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IN THE UNITED STATES DISTRICT COURT 
FOR THE DISTRICT OF COLUMBIA 



INSTITUT NATIONAL DES SCIENCES 

APPLIQUEES (INSA) 

135 Avenue De Rangueil 

31077 Toulouse 

France 



Plaintiff, 



HON. JOHN J. DOLL 

Under Secretary of Commerce for Intellectual 
Property and Director of the United States 
Patent and Trademark Office 
Madison Building 
600 Dulany Street 
Alexandria, Virginia 22314 



Civil Action No. 



COMPLAINT 

Plaintiff Institut National Des Sciences Appliquees (INSA) for its complaint 
against Defendant the Honorable John J. Doll, state as follows: 
1. This is an action by the owners of United States Patent No. 7,439,049 ("the 
"049 patent") seeking review of inaccurate and erroneous Patent Term 
Adjustment ("PTA") calculations made by the United States Patent & 
Trademark Office ("USPTO"). Specifically, this is an action by Plaintiffs under 
35 U.S.C. § 154(b)(4)(A) seeking a judgment that the patent term adjustment of 
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301 days calculated by the USPTO for the x 049 patent should be corrected to 755 
days. 

2. This action arises under 35 U.S.C. § 154 and the Administrative Procedure Act, 5 
U.S.C. §§ 701-706. 

I. THE PARTIES 

3. Plaintiff Institut National Des Sciences Appliquees is a company operating 
under the laws of France. Institut National Des Sciences Appliquees is located 
at 135 Avenue De Rangueil 31077 Toulouse, France. 

4. Defendant John J. Doll is the Under Secretary of Commerce for Intellectual 
Property and Director of the United States Patent and Trademark Office. 
Defendant is sued in his official capacity. 

II. JURISDICTION AND VENUE 

5. This Court has jurisdiction over this action and is authorized to issue the 
requested relief to Plaintiff pursuant to 28 U.S.C. §§ 1331, 1338(a) and 1361; 
35 U.S.C. § 154(b)(4)(A) and 5 U.S.C. §§ 701-706. 

6. Venue is proper in this district pursuant to 35 U.S.C. § 154(b)(4)(A). 

7. This Complaint is being timely filed in accordance with 35 U.S.C. 
§154(b)(4)(A). 

III. BACKGROUND 

8. The inventors of the '049 patent are Sophi Anne Michele Bozonnet, Magali 
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Martine Claude Remaud-Simeon, Rene-Marc Lucien Willemot, and Pierre 
Emmanuel Frederic Monsan. 

9. The '049 patent granted on October 21, 2008, based on patent application 
number 10/509,027, filed September 27, 2004. The '049 patent is attached 
hereto as Exhibit A. 

10. Plaintiff Institut National Des Sciences Appliquees are the assignees of the 
'049 patent, as evidenced by the Assignment recorded in the USPTO at 
Reel/Frame 01 8179/0440, and are the real parties in interest in this case. 

1 1 . When the USPTO issued the '049 patent on October 21, 2008, it erroneously 
calculated the entitled PTA for the '049 patent as 301 days. Had the USPTO 
calculated the entitled PTA properly, the '049 patent would be entitled to 755 
days of PTA. 

12. The errors in the USPTO's PTA calculations are detailed in a recent order from 
the U.S. District Court for the District of Columbia in an action titled Wyeth v. 
Dudas, Civil Action No. 07-1492 (D.D.C. Sept. 30, 2008) where the Court 
granted summary judgment against the USPTO, holding that the USPTO's PTA 
calculation methodology was erroneous as a matter of law and inconsistent with 
the Patent Statute. The Wyeth v. Dudas opinion is attached as Exhibit B. 

13. The correct PTA methodology identified in the prior Wyeth v. Dudas action 
governs the USPTO's calculation of PTA for Plaintiffs x 049 patent. 
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IV. COUNT I: U.S. PATENT NO. 7,439,049 

14. Plaintiff incorporates by reference the allegations in paragraphs 1-14 above, as 
if fully set forth herein. 

15. During prosecution of the "049 patent, the patent owners accrued 429 days of 
PTA under 35 USC § 154(b)(1)(A), and accrued 390 days of PTA under 35 
USC 154(b)(1)(B). 

1 6. Under the USPTOs interpretation of 35 USC § 1 54, all PTA accrued under 35 
U.S.C. § 154(b)(1)(A) and all PTA accrued under 35 USC § 154(b)(1)(B) 
inherently overlaps and, thus, it has been the USPTO position that a patent 
holder is only eligible for the larger of these two amounts of PTA. For the '049 
patent, the USPTO erroneously limited the PTA for the '049 patent to 301 days 
(see calculation in paragraph 19, below), as shown on the face of the '049 
patents 

17. In view of a recent decision from this Court (Wyeth v. Dudas, Civil Action 
No. 071492 (JR)), all days on which 35 USC 154(b)(1)(A) or 35 USC 
154(b)(1)(B) apply should accrue PTA for the x 049 patent. 

18. Under the interpretation of this Court (Wyeth v. Dudas, Civil Action No. 07- 
1492 (JR)), each day from the day after November 27, 2005 (14 months from 
the "049 patent application filing date) through to the grant date on October 21, 
2008, qualifies for PTA under 35 U.S.C. § 154(b)(1)(A), (41 1 days, see j 
calculation of paragraph 20, below) and each day from the day after Septemb© 
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27, 2007 (three years from the filing date) through to the grant date on October 
21, 2008 qualifies for PTA under 35 U.S.C. § 154(b)(1)(B) (390 days) for a 
total of 801 days. 

19. In calculating the time accrued under 35 USC 154 (b)(1)(B) the Plaintiff 
disagrees with the USPTO's holding of a total USPTO prosecution delay of 
301 days under 35 USC 154(b)(2)(B). The Plaintiff believes that the total 
USPTO prosecution delay should be 383 days under 35 USC 154(b)(2)(B) 
because the USPTO incorrectly deducted 82 days due to the filing of a 
"Letter" on May 30, 2008. The "Letter" of May 30, 2008 explained to the 
Examiner that the initial Information Disclosure Statement (IDS) filed on 
September 27, 2004 was in compliance with 37 CFR § 1.98. The USPTO 
incorrectly considered the "Letter", a Supplemental Reply to applicant's 
original request regarding the IDS that was filed April 10, 2008. However, 
the "Letter" was as noted above and was submitted to correct an error by the 
Examiner. Thus there should have been no 82 day deduction made to the 
accrued PTA of 390 days 

20. Plaintiff concede a two day delay under 35 USC 1 54(b)(l )(B) for the 
response filed December 26, 2007 to the Office Action issued September 24, 

2007, as well as a 44 day delay for paper filed after Allowance on May 23, 

2008. Plaintiff further concedes an overlap of 18 days of credits accrued 
under 35 USC 154(b)(1)(A). Thus the total PTA should be: 

a. 429 days credit under 35 USC 154(b)(1)(A) - 1 8 days overlapping = 41 1 
days of PTA under 35 USC 154(b)(1)(A). 
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b. 390 days credit under 35 USC 1 54(b)(1)(B) - 46 days delay by applicant = = 
344 days of PTA under 35 USC 154(b)(1)(B). 

c. Thus the total PTA is 41 1 days under 35 USC 1 54(b)(1)(A) + 344 days 
under 35 USC(b)(l)(B) = 755 days 

2 1 . Under the USPTO's interpretation, the USPTO had calculated an erroneous 
PTA of 429 - 1 28 = 301 days. 

21 . The Plaintiffs imposition of only 301 days of PTA for the '049 patent is 
arbitrary, capricious, and abuse of discretion, or otherwise not in accordance 
with law and in excess of statutory jurisdiction, authority or limitation. 

22. It is accordingly believed that the overall PTA accrued by the Plaintiff is 751 > 
days, and the patent holder accordingly requests 755 -301 = 454 
ADDITIONAL days of PTA. 




to reflect the 755 days of actual PTA due the '049 patent. 



from 301 days to 755 days and requiring Plaintiff to alter the terms of the '049 patent 



A. 



WHEREFORE, Plaintiffs respectfully pray that this Court: 



Issue an Order changing the period of PTA for the '049 patent term 



B. 



Grant such other and further relief as the nature of the case may 



admit or require and as may be just and equitable. 
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Dated: April 6, 2009 Respectfully submitted, 




:orrie (DC Bar No. 224469) 
BIRCH, STEWART, KOLASCH & BIRCH, LLP 
8110 Gatehouse Road, Suite 100 East 
Falls Church, Virginia 22042 
Attorney for Plaintiff 

Of Counsel, 

MaryAnne Armstrong, Ph.D. 
BIRCH, STEWART, KOLASCH & BIRCH, LLP 
8110 Gatehouse Road, Suite 1 00 East 
Falls Church, Virginia 22042 
Attorney for Plaintiff 
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ABSTRACT 



The invention relates to an isolated polypeptide with an gly- 
cosyl transferase enzymatic activity for producing dextrans 
with a(l -*2) sidechains, comprising at least one region for 
bonding to glucan and a catalytically active region situated 
beyond the region bonding to glucan. The invention further 
relates to polynucleotides coding for said enzymes and vec- 
tors containing the same. 

14 Claims, 6 Drawing Sheets 
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NUCLEIC ACID MOLECULES CODING FOR 
A DEXTRAN-SACCHARASE CATALYSING 
THE SYNTHESIS OF DEXTRAN WITH a 1,2 
OSIDIC SIDECHAINS 

5 

The present invention relates to the field of glycotechnol- 
ogy, more particularly to the synthesis of oligosaccharides or 
oligosides with a prebiotic, therapeutic or diagnostic effect. 

The present invention pertains to nucleic acid molecules 
encoding an enzyme having a glycosyltransferase activity 10 
catalyzing the synthesis of dextrans or oligosides carrying 
a(l-»>2) osidic type linkages. 

The invention also pertains to enzymes synthesized by the 
nucleic acids of the invention, and to their expression systems 1 5 
in prokaryotic or eukaryotic cells. Finally, they pertain to the 
use of said enzymes in the production of oligosaccharides in 
foodstuffs, or as an active principle in therapeutic and/or 
cosmetic products. 

Oligosides and heterooligosides act as recognition and 2 o 
effector signals in both animals and plants (as oligosaccha- 
rines)by specifically binding to lectins, glycosyltransferases, 
glycosidases, adhesion molecules etc. The antigenic determi- 
nants of blood groups are osidics and our defense against 
many pathogenic bacteria is directed against osidic structures 25 
of the bacterial envelope. Further, one of the major reasons for 
xenograft rejection is the existence of osidic structures 
belonging to each species. Such properties, and the knowl- 
edge acquired in recent years regarding glycosyltransferases 
and lectins, contribute to making certain oligosides the can- 30 
didates of choice for therapeutic or prophylactic treatment of 
disorders linked to the microbiological equilibrium of various 
organs such as the intestine or skin. As an example, oligosides 
constitute an interesting alternative to the use of micro-organ- 
isms and antibiotics in regulating the composition of intesti- 35 
nal flora (prebiotic effect). Certain oligosides can be consid- 
ered to be "soluble fiber*' when they are not metabolized by 
human and animal digestive enzymes; on reaching the colon, 
they interact with the microbial flora and specifically affect 
the growth and adhesion of certain species. If they are incor- 40 
porated into food in low doses (less than 1%), certain osidic 
molecules improve health and stimulate weight gain in ani- 
mals. 

A review of different glycosyltransferases, their structure 
and their activity, has been carried out by Vincent Monchois 45 
et al (1). Briefly: 

a) it appears that the structure of the glycosyltransferases 
and/or dextransucrases studied is highly conserved and is 
constituted, starting from the amino part of the protein, by a 
signal sequence, a variable domain, a catalytic domain and a 50 
glucan binding domain. 

b) glucooligosides (GOS) can be synthesized by glycosyl- 
transferases such as dextransucrases from cheaper substrates 
such as saccharose and in the presence of a glucose accepting 
sugar. Other substrates such as a-D-fluoroglucose, para-ni- 55 
trophenyl-a-D-glucopyranoside, a-D-glucopyranoside-a- 
D-sorbofuranoside or4-0-a-D-galactopyranosylsucrose can 
also be used. 

Starting from the substrate, such enzymes catalyze the 
transfer of glucose units onto acceptor molecules. In the 
presence of a glucose acceptor such as maltose or isomaltose, 
glycosyltransferases catalyze the synthesis of low molecular 
weight oligosaccharides primarily comprising chains with 3 
to 7 glucoses, in accordance with the reaction: 

Saocharosc+acccptor-^gluccsylated acceptor+fhic- 



60 



65 



In such cases, enzymes generally have a specificity for the 
synthesis of osidic bonds in accordance with that forming the 
donor polymer. 

In contrast, in the absence of an acceptor, the enzyme 
synthesizes high molecular weight dextran type glucans by 
successive transfer of a-D-glucopyranosyl units from sac- 
charose in accordance with the reaction: 

n-saccharosc-*Cg^Licosc) w +n-fructose 

c) The structures and function of glucans or oligosides 
synthesized by glycosyltransferases depends on the produc- 
ing bacterial strain. 

Throughout the present text, the generic term "glycosyl- 
transferases'* is used to designate the different enzymes 
capable of catalyzing the synthesis of glucose polymers from 
saccharose. They are generally produced by bacterial strains 
of the Leuconostoc, Lactococcus, Streptococcus ox Neisseria 
type. The size and structure of the glucans produced depends 
on the producing strain. 

The glucose units are coupled by a(l-»6) osidic bonds as 
in dextran, by a(l-*3) bonds as in the case of mutane, or by 
alternations of the two types (alternane). 

Similarly, the existence and nature of the linkages, their 
length and position varies depending on the origin of the 
producing strain. 

Glycosyltransferases producing glucans or GOSs contain- 
ing at least 50% a(l-»6) bonds are termed dextransucrases. 
GOSs synthesized by said enzymes may carry a(l-»2), 
a(l->3) and/or a(l->4) linkages. Said dextransucrases are 
produced by Leuconostoc mesenteroides type bacteria. 

d) The dextransucrase from L. mesenteroides NRRL 
B-1299 can produce a highly branched dextran the 
majority of linkages of which are of the a(l->2) type. 
Used in the presence of saccharose and maltose, a glu- 
cose acceptor molecule, it results in the formation of 
GOS some of which have a a(l-»2) bond at their non- 
reducing end and others of which have a( 1 ->2) linkages 
on intermediate residues between the ends. For this rea- 
son, they resist degradation by enzymes (hydrolases) of 
the upper digestive tract in man and animals, and are 
only degraded by bacterial genuses that are capable of 
fermenting, such as Bacteroides and Bifidobacterium, 
considered to be beneficial to the host organism. 

An identical phenomenon occurs in the skin, allowing cos- 
metic applications to be envi saged, since a lack of equilibrium 
of the cutaneous microbial flora is the root of numerous 
cosmetic and dermatological problems. For these reasons, 
they are designated "GOS of interest" in the present text. 

Throughout the text, polysaccharides synthesized by the 
glycosyltransferases of the invention are either high molecu- 
lar weight dextrans when the reaction is carried out without a 
glucose acceptor, or oligosides when the reaction is carried 
out in the presence of a glucose acceptor such as maltose or 
isomaltose without this necessarily being specified. The func- 
tionality of the enzyme is characterized by the nature of the 
glucose-glucose bonds, [a(l -»6), a(l-»2)] or others, and not 
by the molecular weight of the polysaccharide that is synthe- 
sized. 

dextransucrases from L. mesenteroides already have a 
number of applications in industry, and in particular those 
from the NRRL B-1299 strain for which a method for syn- 
thesizing GOSs having a(l -»2) linkages has been described 
in European patent EP-B1-0 325 872. 

Marguerite Dols et al (2) showed that the GOS produced 
dextransucrases from that strain are in fact a mixture of at 
least three similar families of molecules differing by the 
number and position of the a(l-*2) type linkages, which 
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leads to the hypothesis that different glycosyltransferase type 
enzymatic activities exist in that bacterial strain. 

Because of the industrial interest pertaining to GOSs with 
a(l -*2) linkages as summarized above in the field of prebi- 
otic foodstuffs, in cosmetics or in pharmaceuticals, the 
present invention aims to isolate and characterize a particular 
enzyme from among those produced by L mesenteroides 
NRRL B-1299 which more particularly would be involved in 
the synthesis of oligosides having a(l-*2) linkages. The 
identification and characterization of such an enzyme have 
the advantage firstly of providing a uniform, reproducible 
method for producing GOSs of interest and secondly of iden- 
tifying the essential characteristics of the producer enzyme 
for said GOSs of interest in order, if appropriate, to improve 
the performance of the products of the enzymatic reaction as 
a function of the envisaged use. 

The technical problem underlying the present invention is 
thus to provide an enzyme and hence isolated nucleic acids 
encoding said enzyme to allow the improved production of 
GOS having a(l -»2) linkages. 

The present invention provides a technical solution to the 
various questions mentioned above by providing a novel dex- 
transu erase, termed DSR-E, encoded by a gene endowed with 
a novel and unexpected structure (dsrE) capable of catalyzing 
the synthesis of glucans or oligosaccharides containing 
a(l-*2) linkages. 

Between the date of filing of the priority document, French 
patent number 0103631 in which the dextransucrase of the 
invention was termed DSR-D, and that of the present appli- 
cation, another dextransucrase, different from the enzyme of 
the invention, was described and also termed DSR-D. For this 
reason, in the present patent application, the dextransucrase 
described, claimed and shown in FIG. lb) is no longer des- 
ignated DSR-D as in the priority document, but is termed 
DSR-B. In fact, the DSR-D dextransucrases in said priority 
document and DSR-E are completely identical. 

The term "novel and unexpected structure" means that the 
organization of the protein differs from that of all other gly- 
cosyltransferases described until now (1) with a catalytic 
domain located upstream of a glucan binding domain, the 
latter constituting the carboxylic portion of the protein. 

The present invention thus concerns an isolated polypep- 
tide having an enzymatic glycosyltransferase activity capable 
of forming dextrans having a(l-*2) linkages, characterized 
in that it comprises at least one glucan binding domain and a 
catalytic activity domain located downstream of the glucan 
binding domain. The term "located downstream" means the 
fact that the amine portion of the sequence with catalytic 
activity or catalytic domain is proximal to the carboxylic 
portion of the glucan binding domain. These two domains can 
be immediately contiguous or, in contrast, they may be sepa- 
rated by a variable domain. 

The glycosyltransferase of the invention preferably com- 
prises a signal peptide. 

In one implementation of the invention, the glycosyltrans- 
ferase comprises two catalytic domains located either side of 
the glucan binding domain. 

The presence of a domain with catalytic activity in the 
carboxylic portion of the enzyme is an essential characteristic 
of the latter in its capacity to form osidic a(l-»2) bonds. In 
fact, as will be shown in the experiments described below, 
deletion of this domain in an enzyme having at least two 
catalytic domains results in the production of glucans or 
oligosides essentially having a(l-*6) type osidic bonds and 
free of a(l-*2) type bonds. 
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More precisely, the catalytic domain, as long as it is located 
downstream of a glucan binding domain, allows the synthesis 
of oligosides containing a(l ~*2) bonds. 

Further, the experiments described below demonstrate that 

5 the specificity of the dextransucrase DSR-E function, namely 
its capacity to catalyze the formation of a(l -*2) osidic bonds, 
can be attributed not to the concomitant presence of two 
catalytic domains but rather to the concatenation of a glucan 
binding domain and a catalytic domain, and more particularly 

10 the CD2 catalytic domain. 

A comparative analysis of the different glycosyltrans- 
ferases including dextransucrases has demonstrated a very 
high degree of conservation of their catalytic domain. 
The catalytic domain located in the carijoxy-terminal por- 

1 5 tion of the glycosyltransferase of the invention has a sequence 
having at least 44% identity and 55% similarity with the 
catalytic domains of the other analyzed glycosyltransferases. 
In particular, the catalytic domain in the carboxylic portion of 
the glycosyltransferase of the invention has at least 65% 

20 identity and at least 80% similarity with the SEQ ID No: 1, the 
catalytic triad Asp/Glu/Asp in respective positions 230/268/ 
342 being conserved. 

Throughout the text, the term "X %" similarity" with 
respect to a reference sequence means that X % of the amino 

25 acids are identical or modified by conservative substitution as 
defined in the ChistalW amino acid alignment software 
(http:///bioweb.pasteur.fr/docs/doc-gensoft/clustalw//) and 
that (100-X)% can be deleted, substituted by other amino 
acids, or that (100-X)% can be added to the reference 
sequence. A particular primary structure of the enzyme of the 
invention is shown in SEQ ID No: 2 which represent a 
sequence of 2835 amino acids of a dextransucrase of L. 
mesenteroides NRRL B-1299. 

35 This dextransucrase, denoted DSR-E, like most glycosyl- 
transferases and dextransucrases, has a signal sequence, a 
variable domain of low conservation, a highly conserved 
catalytic domain (CD1), a glucan binding domain (GBD) and 
a second catalytic domain (CD2) in the carboxylic portion of 
the protein. DSR-E is the first glycosyltransferase analyzed 
and has two catalytic domains, in the configuration shown in 
FIG. lb). It is also the first glycosyltransferase the catalytic 
domain of which is located in the carboxylic portion of the 
protein. 

j 5 FIG. lb) also shows that the glucan binding domain is 
substantially longer than that described above for known 
dextransucrases; thus, a further characteristic of the enzymes 
of the invention is the size of this domain which is over 500 
amino acids. 

50 A comparison and analysis of the DSR-E sequence with 
the sequences of the glycosyltransferases or dextransucrases 
that have already been described (1), and the means used to 
this end are indicated in Example 2 detailed below. It clearly 
shows that while the existence of two catalytic domains sub- 

55 stantially differentiates DSR-E from other enzymes, in con- 
trast the sequences of said domains are substantially con- 
served. In particular, the amino acids necessary for catalytic 
activity are conserved in the second catalytic domain, namely 
the triad Asp/Glu/Asp located in respective positions 2210/ 

60 2248/2322 of SEQ ID No: 2. 

Thus, the invention also concerns any isolated polypeptide 
having a catalytic glycosyltransferase activity that is capable 
of forming dextrans or oligosaccharides having a(l -»2) link- 
ages as obtained by modification, substitution, insertion or 

65 deletion of amino acid sequences but comprising sequences 
having at least 80% and preferably at least 90% similarity 
with the following sequences of SEQ ID No: 2: 
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423-439 


2120-2138 


478-501 


2161-2184 


519-539 


2202-2214 


560-571 


2243-2250 


631-645 


2315-2322 


1014-1021 


2689-2696 



Preferably, finally, a polypeptide with catalytic activity of 10 
the invention contains the following amino acids: 

W in positions 425 and 2122; 

E in positions 430, 565 and 2127, 2248; 

D in positions 487, 489, 527, 638, 2170, 2172, 2210 and 
2322; 15 

H in position 637 and 2321 ; 

Q in position 1 01 9 and 2694. 

The polypeptides with glycosyltransferase activity that can 
form osidic ct(l-*2) bonds can be in the isolated form or, in 
contrast, integrated into a larger protein such as a fusion 20 
protein. It may be advantageous to include sequences having 
another function, such as a specific tag sequence of a ligand 
that can facilitate purification. These tag sequences can be of 
the following types: GST (glutathione-S-transferase), intein- 
CBD (chitin-binding domain) (sold by New England Biolabs, 25 
http://www.neb.com), MBD (maltose binding domain), 
polypeptides containing contiguous histidine residues that 
can facilitate purification of the polypeptide with which it is 
fused. The skilled person could design any other fusion pro- 
tein that could associate the function of the DSR-E of the 30 
invention with another function, a non limiting example being 
a sequence increasing the stability of the enzyme produced by 
expression in a recombinant host or a sequence that can 
increase the specificity or efficacy of action of said enzyme, or 
a sequence aimed at associating another connected enzymatic 35 
activity. 

Such fusion proteins also fall within the scope of the inven- 
tion provided that they contain the CD2 domain of the glucan 
binding site. In the same manner, fragments of SEQ ID No: 2, 
comprising at least SEQ ID No: 1 and the glucan binding 40 
domain, alone or integrated into a larger polypeptide forms 
part of the invention, as long as the enzymatic activity of the 
dextransucrase is conserved. 

Variations of the polypeptide sequences defined above also 
form part of the invention. In addition to the polypeptides 45 
obtained by conservative substitution of the amino acids 
defined above, the variations include polypeptides the enzy- 
matic activity of which is improved, for example by directed 
or random mutagenesis, by DNA shuffling, or by duplication 
of the CD2 catalytic domain. 50 

The particular structure of this enzyme identified in the 
present invention results from a process comprising: 

a) identifying and isolating dextransucrase from L 
mesenteroides catalyzing the production of GOSs of 
interest carrying a( 1 -*2) linkages; 55 

b) sequencing the enzyme fragments; 

c) synthesizing amplification primers that can amplify the 
gene corresponding to the producing strain or fragments 
thereof; 

d) sequencing the amplified fragments; 60 

e) cloning in specific vectors and their expression in appro- 
priate hosts. 

The features of the method employed are given in detail in 
the experimental section below. The first step consists of 
separating the proteins by polyacrylamide gel electrophoresis 65 
and identifying bands having a dextransucrase activity by an 
in situ enzymatic reaction in the presence of substrate and 



acceptor. The nature of the GOSs synthesized is then identi- 
fied for each band by HPLC analysis using the methods 
described in (1). The retention time for the oligosides in 
HPLC depends on the nature and organization of their osidic 
bonds. In particular, it is possible to distinguish between those 
constituted by residues having a(l->6) bonds, having 
a(l -»6) bonds with a a(l -»2) linkage at the nonreducing end 
of the molecule, and the desired compounds having a linear 
a(l~>6) chain with a(l->2) linkages. 

The inventors therefore isolated and identified dextransu- 
crase from L mesenteroides NRRL B-l 299 producing GOSs 
of interest. 

A reverse engineering process carried out in steps b) to e) 
above then provide the nucleotidie sequence encoding the 
enzyme, allowing industrial scale production and, if appro- 
priate, allowing it to be modified, improving its performance 
using techniques that are available to the skilled person. As an 
example, directed or random mutagenesis or DNA shuffling 
can be cited (3). 

The invention also pertains to an isolated nucleic acid 
molecule encoding an enzyme with glycosyltransferase 
activity that can form dextrans or oligosides having a(l —*2) 
linkages and comprising at least one sequence encoding a 
glucan binding domain, and at least one nucleotide sequence 
encoding a catalytic domain located on the 3* side of the 
foregoing, said sequence encoding a catalytic domain having 
at least 50% and preferably at least 70% similarity with SEQ 
ID No: 3. 

The term "similarity" means that for the same reading 
frame, a given triplet is translated by the same amino acid. 
Thus, this term includes modifications to bases resulting in 
degeneracy of the genetic code. 

The percentage similarity is determined by comparing a 
given sequence with the reference sequence. When they have 
different lengths, the percentage similarity is based on the 
percentage of nucleotides in the shortest sequence which are 
similar to those in the longest sequence. 

The degree of similarity can be conventionally determined 
using software such as ClustalW (Thompson et al, Nucleic 
Acid Research (1994), 22: 4673-4680) distributed by Julie 
Thompson (Thompson@EMBL-Heidelberg.de) and Toby 
Gibson (Gibson@EMBL-Heidelberg.de) at the European 
Molecular Biology Laboratory, Meyerhofstrasse 1, D-691 17, 
Heidelberg, Germany. ClustalW can also be downloaded 
from a number of websites including IGBMC (Institut de 
Genetique et de Biologie Moleculaire et Cellulaire, B P 163, 
67404 Illkirch Cedex, France; ftp://ftp-igbmc.u-strabg.fr/ 
pub/) and EBI (ftp://ftp.ebi/ac.uk/pub/software/) and all sites 
linking to the Bioinformatics Institute (Wellcome Trust 
Genome Campus, Hinxton, Cambridge, CB10 1SD, UK). 

The isolated nucleic acids of the invention can in particular 
comprise other sequences intended to improve the expression 
and/or activity of the enzyme produced. 

As an example, they can be: 

sequences encoding a signal sequence for their secretion; 
duplication of the sequence encoding the CD2 catalytic 
domain. 

Preferably, an isolated nucleic acid of the invention com- 
prises: 

a) two sequences encoding catalytic domains having at 
least 50%, preferably at least 80% similarity with SEQ 
ID No: 3; 

b) a sequence enclosing the glucan binding domain, the 
latter preferably being located between the two 
sequences in a). 
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A nucleic acid of the invention can also comprise: enzyme encoded by these homologous sequences has the 
a promoter suitable for its expression in a selected host cell; enzymatic activity of the polypeptides of the invention, 

a sequence encoding a signal peptide; and/or The nucleic acid sequences of the invention as described 

one or more variable sequences; above and qualified by derivatives of said molecules as 
said sequence or sequences all being located in the 5 1 por- 5 defined above are generally variations exerting the same bio- 

tion of sequences encoding the catalytic domain(s). logical function. Said variations can be natural variations, in 
Amore particular example of an isolated nucleic acid of the particular those observed from one species to another and 

invention comprises: resulting in interspecies variability or, in contrast, those intro- 

a) SEQ ID No: 4; duced via directed or random mutagenesis or by DNA shuf- 

b) a sequence having at least 80% similarity with SEQ ID 10 fli °fr 

N 0 . 4- or " Similarly, the invention encompasses isolated nucleic 

c) the complementary strand to sequence a) or b); or acid f enco ^8 a glycosyltransferase that can catalyze the 

d) a sequence hybridizing a), b) or c). s ^ thes ^ s °™extran or obgosacchande carrying at least 20% 
lie hybridization in d) is carried out under standard con- ^ff £ * ^ ™ ^ ***** 

ditions, and preferably under stringent conditions. The term 15 by D * A ^ com P ns ^ f 

"bybridizationundersW*^^ 3 f ° r T 

atleast80%sequenceiden%wimmesequencewhichisto defined above and m particular SEQ ID Nos: 3 and 4 and 

be hybridized, preferably an identity of at least 90% of the establishing the variations; 

sequence which is to be hybridized, under conditions which a stepper expressmg a host housing a variation from said 

ar?, for example, described in Sambrook and Russel (3^ 20 ™ f "! * * t 

edition, 2001, Cold Spring Harbor Laboratory Press, Cold a f * for ""T 1 * hosts expressing an enzyme that can 

SnrinaHarbor NY.) more preferably more than 30% 

\r. . ^* \ «• « a(l ^2) bonds on a suitable substrate and a step for 

The invention also concerns a gene encoding a dextransu- isolating the improved gene or genes 

erase that can form at least 15% a(l ->2) linkages. In addition A lA 5 i ■ -j al • ^ i 

totheencodingsequence,thegeaecomprise SS equence S that 25 ^wMw^^^n^cmtotU^ 

« • • 2 j ^ + , , ,, ^ , a) a sequence contaimng at least 80% similarity with the 

allow transcnption initiation and sequences that allow attach- ' n °, ^ 4 , 

* r oxta » *l -i_ mnD> Dminvi sequence encoding a dextransucrase expressed by the 

mentofmessengerRNAtothenbosome(RBS . SEQ ID No: plasmidpCR-T7-<krEin£ ^//deposited at the CNCM 

I "K /S a$ 18 *««««»- on ! 5 <* ^ 2001 ^ accessjon n ^be r L2649 (£ . co/j - 

^NRKLB-1299. „. , ATn 30 JM 109 [pCR-T7-dsrD]), or 

The nucleotides upstream of the translation mixtion ATG fc) , of me sequence m a) . 

are numbered l to 232^ The denommation of me strain transformed by the recom- 

The existence of a RBS sequence can be identified between binant , asmid pCR.^^ deposited at the CNCM is that 

nucleotides 218 and 223 as well as the consensus sequences indicated above in brackets. This does not affect the change to 

^ and -10 located between nucleotides 82 and 86 35 thedenominationofthegenecaniedoutfollowingdeposition 

(TTGAA)ontheonehandand lOOand 105 (ATAAAT) on the of said strajn for me leasoDS givena b 0 ve. 

o ernan . mven tion a] so concerns nucleic acid fragments as 

Any nucleic acid sequence that can be hybridized with defined above, which are hybridizable with SEQ ID No: 4 and 

DNA of SEQ ID No: 4orits complementary strand is capable can be used as hybridi2ation probes for detecting sequences 

of encoding an enzyme having the properties and character- ^ encoding the enzymes ofthe invention. Said fragments can be 

istics ofthe enzyme ofthe invention. This applies to natural pre pared using any technique known to the skilled person, 

sequences existing in microorganisms other than L In addition to hybridization probes, amplification primers 

mesentervides NRRL B-1299 and isolated from gene librar- also form part of the mventioi] , Said pjrimers ^ fragments 

les of micro-organisms, and to those prepared by genetic which are hybridizable with SEQ ID No: 4 or with its comple- 

engineering or by chemical synthesis. 45 mentary strand and which allow amplification of specific 

In particular, the sequences upstream of the translation sequences encoding dextransucrases present in a prokaryotic 

initiationATGandnecessaryforexpressionoftheproteincan oreukaryotic anim al or plant organism, 

advantageously be substituted by transcription initiation and/ The use of said amplification primers allows the use of a 

or ribosome binding sequences suitable for the expression method for identifying the possible existence of a gene encod- 

system selected for the coding sequence. 50 i ng an enzyme that can catalyze synthesis of GOS with 

A nucleic acid sequence that is capable of hybridizing a (l ->2) linkages in said organism, said method also forming 

under stringent conditions with the isolated nucleic acid of part ofthe invention. 

the invention also comprises fragments, derivatives or allele The invention also concerns expression vectors comprising 

variations ofthe nucleic acid sequence ofthe invention which a nucleic acid as described above under the control of a 

encodes a protein having the enzymatic activity described 55 sequence allowing its expression and preferably its excretion 

above. Thus, the fragments are defined as fragments of in prokaryotic or eukaryotic cells. The term "prokaryotic 

nucleic acid molecules that are sufficiently long to encode a cells" preferably denotes bacteria selected from a group com- 

protein that has conserved its enzymatic activity. This also prising E. co!i t Lactococcus, Bacillus and Leuconostoc. The 

encompasses fragments that arc free of the sequence encod- term "eukaryotic cells'* preferably means eukaryotes selected 

ing the signal peptide responsible for protein secretion. 60 from a group containing yeasts, fungi and plants. 

The term "derivative" means a sequence that is different The vector comprises a promoter suitable for expression of 
from the original sequence in one or more positions but which the isolated nucleic acid of the invention in the selected 
has a high degree of similarity with said sequences. In this expression system. As an example, the T7 bacteriophage pro- 
context, "similarity" means at least 80% identity of the nucle- moter could advantageously be selected for expression in E. 
otides, preferably at least 90% identity with the original 65 coll 

sequence. The modifications in this case are deletions, sub- The invention also concerns host cells, prokaryotic or 

stitutions, insertions or recombinations provided that the eukaryotic, transformed by a nucleic acid of the invention, 
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preferably comprised in an expression vector carrying a pro- a step for randomly modifying SEQ ID No: 4 and es tab- 
mo ter, adapted for expression in the selected host cells. The lishing a library of variations; 

transformed cells are selected from Gram - bacteria such as E. a step for expressing a host housing a variation from said 

coli, or from Gram* bacteria such as Lactococcus, Bacillus, modified sequences in a suitable host cell; 

Leuconostoc, or from eukaryotes in a group comprising 5 a step for screening hosts expressing an enzyme that can 

yeasts, fungi and plants. form more than 1 5% and preferably more than 30% of 

One particular example of a cell transformed in accordance a( 1 -*2) bonds on a suitable substrate; 

with the invention is the E. coli strain harboring a plasmid and a step for isolating the improved gene or genes, 

termed pCR-T7-dsrE carrying the SEQ ID No: 4 under the In a further implementation of the invention, the method 

control of the T7 bacteriophage promoter deposited at the 10 consists of modifying SEQ 3D No: 3 by duplicating all or part 

CNCM on 1 5 th Mar. 2001 under accession number 1-2649 . of the CD2 catalytic domain. 

The present invention also concerns a method for produc- It should be understood that the methods above are not only 

ing a glycosyltransferase that can form dextrans or oligosides aimed at obtaining a glycosyltransferase that can form oligo- 

having at least 15% and preferably at least 20% of type sides having a constant and reproducible percentage of 

a(l -*2) osidic linkages and compri sing: 1 5 a(l ->2) linkages of more than 1 5% of the total linkages, but 

a) inserting a nucleic acid or a vector as defined above into also to improve the degree of a(l -*2) linkages with the aim 
a host cell that can express and preferably secrete the of modifying the properties of the oligosides obtained to 
glycosyltransferase; improve their dietetic properties or their capacity to maintain 

b) characterizing the enzymatic activity being investigated or re-establ ish bacterial flora associated with certain organs of 
using any of the methods accessible to the skilled per- 20 the human or animal body. 

so n - Finally, the present invention concerns glycosyltrans- 

c) purifying the enzyme from a cell extract. ferases that can be obtained by a method as defined above and 
The term "method for characterizing enzymatic activity which ^ f °nn at least 1 5% and preferably at least 30% of 

known to the skilled person" means the methods described in a ( 1 ~* 2 ) °sidic linkages in glucooligosaccharides. 

the literature, for example in reference (2), andnovel methods 25 Finally, the invention pertains to the use of glycosyltrans- 

that may be developed to allow identification and mscrimi- ferases of ^ invention as well as those that can be obtained 

nation of glucooligosaccharides having the desired degree of bv methods mentioned above, in the production of a 

linkages. composition with a pre-biotic effect or in the manufacture of 

In feet, it concerns any screening method that can identify a dermatological, cosmetic or pharmaceutical composition, 

the presence of a(l -*2) linkages in a GOS. 30 Non-limiting examples that can be cited are the improve- 

Examples are- meilt m mtestma l transit in animals and in man, an improve- 

HPLC in which GOS migration varies as a function of the ment ' m ^ }cinm f ^ magnesium assimilation and of min- 

nature and position of the linkages, in particular those erals m S eneral > preventing cancer of the colon and 

containing the a(l-*2) bond at the reducing end and ^ ntl ^ r ** atment of skin Actions such as acne, dan- 

those containing this bond on the penultimate glucose; 35 or hody odor. 

and/or e ^^n^S 6 °f me polypeptides and nucleic acids 

nuclear magnetic resonance (NMR); encodin S said Peptides of the invention is not only in 

the existence of a positive reaction with specific mono- ™P™™ »*™» V*. t^cmihty and 

clonal antibodies of a(l -2) bonds on the reducing end „ co * of^ly^syltransferases that can form ohgosacchandes 

and/orspecmcmonoclonal4tibomesofa(l-2)bonds 40 ^^<^^)^^^^^V^^^ 

4t _ n- * i r t . n^o enzymes the functionality of which is improved, 

on the penultimate glucose oi the GOS. t-u ^ i 1.1 

. \. . & , - . . . . , The figures, examples and detailed description below pro- 

Reinvention also concerns a .method for obtatmng a gly- videnon . Kmit ^ 

cosyltrar^ferase t^t can have olivines or dextmns having a iUnctionalities of polypeptides with enzymatic activity 

percentage ofa(l-2) linkages of more .than IJ/oandpref- 45 ^ encoding mem. In particular, they can illus- 

erably more than 30% c* he totally of Aeosidac bonds and ^ ^ the specificity of the catalytic domain 

comprising a step for modifying SEQ ID No: 4 by addition, t fa ^ e carbt / xylic ^ J the e £ te ^ 

deletion or mutation provided that: ^ ^ ^ ^ evolution to obtain improved enzymes, 
me reading frame is not modified; and 

the following amino acids are conserved after translation: 5Q KEY TO FIGURES 
W in positions 425 or 2 1 22, encoded by the TGG triplet in 

positions 1273 and 6364; FIG. 1: Structure of native glycosyltransferases and 

E in positions 430, 565, 2 1 27 and 2248, encoded by GAA derived recombinant proteins : FIG. la) shows the structure of 

triplets in positions 1 288, 1 693, 6379 and 6742 respec- glycosyltransferases and dextransucrases described in the lit- 

tively; 55 erature (1). PS: signal peptide; ZV: variable zone; CD: cata- 

D in positions 487, 489, 527, 638, 2170 and 2210, encoded lytic domain; GBD: glucan binding domain. FIG. lb) shows 

by GAT triplets in positions 1459, 1465, 1579, 1912, the structure of the glycosyltransferase of the invention. 

6508 and 6628 respectively; FIGS. 1c) to It) show different constructions comprising 

D in positions 2172 and 2322 encoded by GAT triplets in deletions in comparison with native DSR-E protein. A(PS) 

positions 6514 and 6964; 60 corresponds to the control constituted by the entire form 

H in position 637 and 2321, respectively encoded by the cloned into the pBAD-TOPO thiofusion system (lnvitrogen). 

CAT triplet in position 1 909 and CAC in position 696 1 ; FIG. 2: Diagrammatic summary of the method for cloning 

Q in positions 1019 and 2694, respectively encoded by the nucleotide sequence encoding a glycosyltransferase of the 

triplets CAA (position 3055) and CAG (position 8080). invention using a genome library by using a PCR probe 

A method for producing a glycosyltransferase according to 65 described in Table 1 and a Hindlll/EcoRV probe respectively, 

the invention having the same characteristics as above can FIG. 3: Comparison of the signal sequences of different 

also comprise: glycosyltransferases of L mesenteroides (residues 1-40 of 
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SEQ ID NO: 2). The conserved amino acids are shown in ug/ml of ampicillin, 0.5 mM of isopropyl-|5-D-thiogalacto- 

bold. DSR-B: L mesenteroides NRRLB-1299 (4) (SEQ ID pyranoside (IPTG) and 40 ug/ml of 5-bromo-4-chloro-3-in- 

NO: 45); DSR-S: L. mesenteroides NRRLB-512F(5) (SEQ dolyI-|3-D-galactopyranoside (X-gal). E. coli TOP 10 cells 

ID NO: 46); ASR: L mesenteroides NRRL B-1355 (6) (SEQ were used to clone the PCR TOPO Cloning (Invitrogen) 

ID NO: 47). 5 product and cultivated on LB medium supplemented with 

FIG. 4: Alignment of 1 1 repeat sequences (SEQ ID NOS: kanamycin in a concentration of 50 ug/ml. 

50-61) of the DSR-E enzyme and observed in the variable Regarding expression of dsrE, the ECHO Cloning System 

zone. cloning kit (Invitrogen) allows a PCR product to be cloned in 

FIG. 5: Alignment of conserved sequences in the catalytic a donor vector (pUNI/V5-His-TOPO), preceding a step for 

domain (SEQ ID NOS: 6-17 and 62-103 to recombination with a suitable acceptor vector (pCR-T7-E). 

Block A: essential amino acids of the N-terminal portion of This system requires E. coli PYR1 , TOP 1 0 and PL2 1 (DE3) 

the catalytic domain; pLysS cells cultivated on LB medium supplemented with 50 

Block B: amino acids of the catalytic saccharose binding u.g/ml of kanamycin as well as 34 u.g/ml of chloramphenicol 

domain; fortheBL21(DE3)pLysS strain. 
Blocks C, D, E: blocks containing three amino acid resi- 15 Digested and dephosphorylated pUC18 plasmids from 
dues involved in the catalytic triad (6); Pharmacia (Amersham Pharmacia Biotech) were used to con- 
Block F: sequence containing glutamine 937 of GTF-1 stitute the genomic DNA library of jL mesenteroides NRRL 
studied by Monchois et al (7). B-l 299. Cloning of the PCR product necessitated the use of 
The entirely conserved amino acids are indicated in bold. the pGEM-T Easy plasmid (Promega) and TOPO-XL plas- 
"*": conservative substitutions; semi -conservative sub- 20 mid (Invitrogen) for fragments of more than 2 kbp. 
stitutions; - - - : gap. The numbering is that for SEQ ID No: 2. The pBAD-TOPO Thiofusion system (Invitrogen) used to 
FIG. 6: HPLC characterization of products synthesized by construct the different deleted forms of the DSR-E protein 
recombinant enzyme DSR-E. used the araBAD promoter the control mechanisms for which 
6A: HPLC analysis of glucooligosaccharides obtained involve the AraC regulatory protein. In the absence of an 
with dextransucrases of L. mesenteroides NRRL B-l 299. 25 inducer, namely L-arabinose, the dimeric AraC protein asso- 
6B: HPLC analysis of glucooligosaccharides obtained by ciates with the regulatory structures of the operon and 
recombinant DSR-E. The following peaks are identified: entrains the formation of a DNA loop, said loop then blocking 
1 : fructose transcription of genes placed under the control of the araBAD 
2: maltose; promoter. In the presence of L-arabinose, in contrast, AraC 
3: sucrose; 30 forms a complex which liberates the DNA loop and allows 
4: panose; transcription initiation. The base expression can be limited by 
5: R4; adding glucose to the culture medium: this reduces the level 
6: OD4; of cyclic AMP and thus concomitant activation of the CAP 
1- R5; protein (cAMP activator protein). The level of activation 
8: OD5; 35 obtained is a function of the concentration of L-arabinose so 
A, B, C: unidentified peaks. that the optimum conditions for production of the protein of 
6C: recombinant DSR-E deleted from the catalytic domain interest can be selected with accuracy, 
of the carboxylic portion of the enzyme (ADSR-E). Further, the use of this vector can allow a 12 kDa thiore- 
FIG. 7: HPLC analysis of acceptor on maltose reaction doxin tag to be positioned on the N-terminal end of the protein 
products synthesized by different entire forms and deleted 40 0 f interest. This fusion encourages the translation of the gene 
from the DSR-E protein. encoding said protein of interest. The tag protein can also 
L.m. B-1299: mixture of dextransucrases produced by L. enhance the solubility of the protein to which it is fused. The 
mesenteroides NRRL B-1299. pBAD-TOPO Thiofusion system is designed to allow ready 
The peaks were identified as follows: elimination of the thioredoxin tag by simple cleavage using 
F: fructose; 45 enterokinase. Finally, using this expression system, a histi- 
M: maltose; dine tag is inserted on the C-terminal end side of the protein 
S: saccharose of interest. Such a tag is used to purify said protein by affinity. 
P: panose; Within the context of using this system, the E. coli TOP 1 0 
R4, R5 : GOS comprising a(l -»2) bonds; strain was cultivated on LB medium supplemented with 1 00 
OD4, OD5: GOS free of a(l— 2) bonds. 50 u.g/ml of ampicillin. 

MATERIALS AND METHODS ^ Gel Electrophoresis, Location and Characterization of 

" * ' " " ~ ' " *~ Enzyme: 

After culturing L mesenteroides NRRL B-1299 for 7 h, the 

1) Bacterial Strains, Plasmids and Growth Conditions: 55 me dium was centrifuged (7000 rpm, 4° C, 30 min) and the 

All strains were kept at -80° C in tubes containing 1 5% cells, in which 90% of the enzymatic activity was found, were 

glycerol (v/v). concentrated 10 times in an acetate buffer solution (20 mM, 

Leuconostoc mesenteroides B-1299 (NRRL. Peoria, USA) pH 5.4), heated for 5 minutes at 95° C. in the presence of 

was cultivated at 27° C. with stirring (200 rpm) on standard denaturing solution (tris HC1 62.5 mM, SDS 4%, urea 6M, 

medium (saccharose 40 g/1, potassium phosphate 20 g/1, yeast 60 bromophenol blue 0.01 % and p-mercaptoethanol 200 mM). 

extract 20 g/1, MgS0 4 ,7H 2 0 0.2 g/1, MnS0 4 ,H 2 0 0.01 g/1, 300 ul of the mixture was deposited on 7% polyacrylamide 

NaCl 0.01 g/1, CaCl 3 0.02 g/1, FeS0 4 ,7H 2 0 0.01 g/1), the pH gel. After migration, the total proteins were revealed by 

being adjusted to 6.9. amido black staining, while the dextransucrase activity was 

Escherichia coli DH5a and JM1 09 were cultivated on LB detected by staining with SchifFs reagent polymer after syn- 

medium (Luria-Bertani). 65 thesizing the dextran in situ. The bands corresponding to the 

Selection of pUC 1 8 or pGEM-T Easy recombinant clones active dextransucrases were excised and incubated separately 

was carried out on LB-agar dishes supplemented with 100 in 2 ml of 20 mM sodium acetate solution, pH 5.4, containing 
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100 g/1 of saccharose and 50 g/1 of maltose. After total con- 
sumption of saccharose, the reaction was stopped by heating 
to 95° C. for 5 minutes, and the reaction medium was centri- 
fuged for 5 minutes at 15000 g to eliminate the insoluble 
dextran. The samples were analyzed by reverse phase chro- 5 
matography (CI 8 column, Ultra sep 100, 6 um, 5x300 mm, 
Bishoff Chromatography) using ultrapure water as the eluent, 
at a constant flow rate of 0.5 ml/min. The oligosaccharides 
were separated for 30 minutes at ambient temperature and 
detected by refractometry. Peptide sequencing was carried 10 
out on the selected protein bands by the Laboratoire de 
Microsequencage, Institut Pasteur, Paris. 

3) Molecular Biological Techniques Used 

Purification of the E. coli plasmid and purification of the 
genomic DNA of L. mesenteroides was carried out using the 
QiaPrep Spin Plasmid kit and the Cell Culture DNA maxi kit 
(Qiagen) respectively. The amplification and cloning meth- 
ods were carried out using standard techniques (Sambrook 
and Russel, 2001, supra). Restriction and modification 
enzymes from New England Biolabs or Gibco BRL were 
used in accordance with the manufacturer's instructions. 

PCR was carried out with primers selected on the basis of 
the protein sequence obtained on an isolated band from gel 
electrophoresis (see supra, gel electrophoresis and enzyme 
localization). Two peptides were selected: 

29-FYFESGK (SEQ ID NO: 18); and 

24-FESQNNNP (SEQ ID NO: 19) 

and used to synthesis degenerate oligonucleotides indicated 
in Table 1 below. 30 

In this table, the numbering of which follows that of SEQ 
ID No : 4, it appears that the presence of a serine residue in the 
two peptides necessitates the synthesis of two primers for 
each peptide since serine can be encoded by six different 
codons. ECHO-dir and ECHO-inv are primers which allowed 35 
amplification of dsrE by PCR for cloning into the ECHO 
Cloning (Invitrogen) expression system. 



20 



25 



PCR 

PCR was carried out using a Perkin-Elmer thermocycler, 
model 2400, with 50 nanograms of genomic DNA. The quan- 
tities of primers used was 1 0 uM of 29-Dir- 1 and of 24-Inv 1 . 
250 uM of each triphosphate deoxynucleotide and Taq poly- 
merase were added to the reaction mixture. 

After amplification of 25 cycles at 94° C. for 30 seconds 
then at 50° C. for 30 seconds, then at 72° C. for 5 minutes, a 
666 base pair fragment was obtained. 

Certain fragments were amplified using the "Expand Long 
Template PCR" (Roche Boehringer Mannheim) system, in 
accordance with tie manufacturer* s instructions. This system 
can amplify large fragments of up to about 20 kbp highly 
effectively. The combination of two DNA polymerases can 
minimize errors during the elongation phases. 

Southern Hybridization and Gene Library of L mesenteroi- 
des NRRL B-1299 

Chromosomal DNA from L mesenteroides NRRL B- 1 299 
was digested with different restriction enzymes then sepa- 
rated by electrophoresis on 0.8% agarose gel in TAE 0.5x 
buffer. 

Genomic libraries of the bacteria were transferred onto 
nylon hybond N+ membranes (Amersham PnarmiciaBio- 
tech). Hybridization was carried out using the 666 base pair 
fragment of deoxy-adenosine-triphosphate labeled with 32 P. 
The labeling reaction was carried out using the "Mega Prime 
DNA Labelling System Kit" (Amersham PharmaciaBiotech) 
labeling kit, followed by purification of the probe on 
MicroSpin S-200HR columns. Pre-hybridization and hybrid- 
ization was carried out under highly stringent conditions (65° 
C. overnight using the normal methods) (Sambrook and Rus- 
sel, 2001 , supra). 

Reverse PCR 

The reverse PCR reaction produces a linear DNA fragment 
from a circular matrix using divergent primers. 



TABLE 1 



SEQ ID MOS: 13-27 



Designation Description Sequence 5* -3' 



29-dirl 
29-dir2 
24-invl 
24-inv2 
I PCR -rev 



FESQNNNP 



sequence nt 
5769-5798 



TT (C/T) TA (C/TJ TT (C/T)GA(A/G) TCAGG (C/G) AA ( A/G) 
TT (C/T) TA (C/T) TT (C/T)GA(A/G) AGCGG(C/G) AA(A/G) 
(T/G) GG (G/A) TT (G/A) TT (G/A) TTTTGTGA(T/C) TCAAA 
{ T/G } GG (G/A) TT (G/A) TT (G/A) TTTTGGCT(T/C) TCAAA 
CCCTTTACAAGCTGATTTTGCTTATCTGCG 



IPCR-dir 



sequence nt 
8311-8342 



GGGTCAAATCCTTACTATACATTGTCACACGG 



sequence nt 
6-39 



- AGTTGTATGAGAGACATGAGGGTAATTTGTGACCGTAAAAAATTG 



sequence nt 
8457-8504 



AT TTGAGGT AATG TTGATT TAT CACCAT - 
CAAGCTTGAAATATTGACC 
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Genomic DNA from L mesenteroides NRRL B-1299 was 
digested with EcoRV under the conditions recommended by 
the manufacturer. 

After re-circularization, the digestion products were used 
as a matrix in a reverse PCR reaction [Extrapol II DNA 
polymerase (Eurobio), reaction volume of 50 ul, reverse PCR 
reaction parameters: 25 cycles; 94° C; 30 seconds; 51 ° C, 30 
seconds; 72° C, 3 minutes]. The two primers were selected as 
a function of the pSB2 insert sequence as indicated in FIG. 2. 

FIG. 2 summarizes the conditions for obtaining different 
plasmids carrying dsrE fragments by screening the gene 
library and using the probes described above. 

DNA Sequence and Analysis 

After sequencing the peptides, degenerate primers marked 
out because of the frequency of use of codons in the dextran- 
sucrase genes of L mesenteroides NRRL B-1299 were syn- 
thesized and allowed amplification of a 666 bp fragment. 
Sequencing this fragment revealed strong homologies with 
the genes of known dextransu erases, even though it was 
entirely novel. 

The use of this fragment as a homologous probe in South- 
ern experiments allowed positive signals on different tracks 
of genomic DNA to be marked. A first Hindlll library was 
then screened and a recombinant plasmid termed pSB2 con- 
taining a 5.6 kbp insert was purified. An analysis of the 
sequence for this Hindlll fragment revealed an open reading 
frame covering the whole insert. Then a EcoRV library was 
screened with a Hindlll/EcoRV probe isolated at the N-ter- 
minal end of the 5.6 kbp Hindlll insert. A recombinant pSB3 
recombinant plasmid, tested positively by dot-blot, was 
shown to contain a 3.8 kbp insert which, after sequencing, 
was shown to contain the initiation codon for translation and 
the promoter region of the novel dextransucrase gene termed 
dsrE. 

With the aim of obtaining the dsrE terrnination codon, 
reverse PCR was carried out on genomic DNA from L. 
mesenteroides NRRL B-1299 digested with EcoRV and re- 
ligated to itself, using divergent oligonucleotide primers des- 
ignated from the pSB2 insert sequence. A single fragment 
with the expected size of 1 kbp was amplified then cloned in 
pGEM-T Easy to obtain the pSB4 plasmid. After sequencing, 
the amplified sequence located downstream of the Hindlll 
site comprised 221 bp and contained the reading frame ter- 
mination codon for dsrE located 30 bp downstream of the 
Hindlll restriction site. 

Sequencing of the different fragments carried by the three 
plasmids was carried out on both strands by the company 
Genome Express. Sequence analyses of the nucleotides was 
carried out using "ORF Finder" (http://www.ncbi.nlm.nih- 
.gov/gorf7gorf.html), Blast (http://www.ncbi. nlm.nih.gov/ 
blast/b]ast.cqi, Altschul et al, 1997), ClustalW (http:// 
www2 .ebi.ac.uk/clustalw, Thompson etal, 1994), PRODOM 
(http://protein/tolulouse.inra.fr/prodom.html, Corpet et al, 
2000), PFAM (http^/pfam.wustl.edu.hmmsearch.shtml, 
Bateman et al, 2000) and SAPS (http://bioweb.pasteur.fr/se- 
gana/interfaces/saps.html, Brendel et al, 1992), all of this 
software being available on the Internet. 

Protein Expression 

Two cloning and expression systems were used to produce 
recombinant proteins in E. coli. namely the ECHO-CIoning 
and pBAD-TOPO Thiofusion (Invitrogen) systems. 

By way of example, the method for cloning the nucleotide 
sequence encoding the DSR-E protein using the ECHO- 
Cloning system will now be briefly described. 

Two primers as proposed in Table I above were used for 
amplification using the "Expand Long Template" system 
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under the following conditions: 94° C. for 3 minutes, fol- 
lowed by 25 cycles at 94° C. for 30 seconds, 55° C. for 30 
seconds, and 68° C. for 7 minutes. The PCR products were 
then cloned into the pUNl/V5-His-TOPO vector to obtain a 
5 donor vector (pUNI-dsrE) to be recombined with an acceptor 
vector (pCR-T7-E) and adapted for expression in E. coli. The 
final plasmid was designated pCR-T7-dsrE. 

This construction, placing the dsrE gene under the control 
of the bacteriophage promoter T7, allowed inducible expres- 
10 sionof the dsrE gene. 

After induction with 1 mM of IPTG, the transformed E. 
coli BL21 cells were harvested by centrifuging after 4 hours 
growth and re-suspending at a final optical density of 80 at 
600 nm in a 20 mM sodium acetate buffer, pH 5.4, and 1% 
15 Triton XI 00 (v/v) in the presence of 1 mM of PMSF to 
prevent proteolysis in the cell extracts after sonication. 

Similar experiments carried out with the pBAD-TOPO 
Thiofusion system allowed the recombinant vector pBAD- 
TOPO-dsrE to be constructed. 

20 Enzymatic Tests 

Hie enzymatic reactions were carried out under standard 
conditions at 30° C. in a 20 mM sodium acetate buffer, pH 5.4, 
NaN 3 1 g/1 and saccharose, 1 00 g/1. The activity of the DSR-E 

25 enzyme was determined by measuring the rate at which the 
reducing sugars were liberated, represented here by fructose, 
using the dinitrosalicylic acid method which is well known to 
the skilled person. One unit is defined as the quantity of 
enzyme which would catalyze the formation of 1 urnol of 

3Q fructose per minute under standard conditions. The oligosac- 
charides were synthesized in a reaction medium containing 
100 g/1 of maltose, 200 g/1 of saccharose and 0.5 units/ml of 
DSR-E. 

As for the dextran synthesis, the enzymatic reaction was 
35 continued for 24 hours in the presence of 100 g/1 of glucose. 
The dextran produced was precipitated in the presence of 
50% (v/v) ethanol and washed twice in 50% ethanol (v/v) 
prior to freeze drying. It was then dissolved in an amount of 
1 0 mg/ml in D 2 0 and analyzed by 13 C NMR spectrometry. 

40 HPLC Separation 

100 ul samples were removed and heated at 95° C. for 5 
minutes then diluted in ultrapure water to obtain a final con- 
centration of total sugars of less than 5 g/1. After centrifuging, 
the residual substrates and the different species formed were 

45 analyzed by HPLC on a CI 8 column (Ultrasep 100, 6 urn, 
5x300 mm, Bishoff Chromatography). 

The oligosides were separated at ambient temperature for 
30 minutes in ultrapure water used as the ehient, at a flow rate 
of 0.5 ml/min. Detection was accompanied by refractometry. 

50 These conditions allowed the following species to be sepa- 
rated: fructose, maltose, leucrose, saccharose, and oligosides 
with a degree of polymerization that did not exceed 6. 

Calculation of Yields 
55 The method for calculating the yields for the oligoside 
synthesis reactions took into account the residual concentra- 
tion of the acceptor in accordance withthe following formula: 

R={[GOS JinalJ-fmirial GOS/}/{0.474x(sacchraosc 
cons umed]+ (acceptor consumed]} 

60 in which R represents the real yield of the total GOS syn- 
thesis reaction, the concentrations being expressed in g/1. 

Construction of Different Deleted Forms of DSR-E Protein 
The different deleted forms of the DSR-E protein [FIG. 1c) 
65 to If)] were obtained by PCR amplification of fragments 
corresponding to the dsrE gene then cloning in the pBAD- 
TOPO Thiofusion vector described above. The primers used 
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for amplification of the regions selected from the dsrE gene HPLC, 2 peptides: 29-FYFESGK (SEQ ID NO: 18) and 

are shown in Table II below. The positions of the primers are 24-FESQNNNP (SEQ ID NO: 1 9), were sequenced and used 

shown with respect to SEQ ID No: 5, relating to the sequence as a model for the synthesi s of degenerate nucleotide primers, 

for the dsrE gene. The bases mutated to introduce the Ncol The different amplification and cloning steps are shown in 

restriction site are shown in bold and the resulting Ncol site is 5 FIG. 2. The complete gene was inserted into the pCR-T7-E 

underlined. plasmid and expressed in E. coli. 

The production of a functional enzyme was attested by the 

TABLE 2 production of GOSs the HPLC analysis of which is shown in 

FIG. 6b). 

SEQ ID MOS: 28-34 1Q ^ ^ q{ $ ^ ^ ^^^g QQgj ^fa a 

Deoig- a(l -*2) linkage, should in particular be noted. 

nation Posit iona Sequence 5 ' - 3 ' 

p BAD -PS/ 344-373 GCCATGGCAAATACGATTGCACTTGACACG 
ZV-dir 

pBAD-ZV/ 971-1001 GCCATGGA CGGTAAAACCTATTTTCTTGACG 

0151 dir 2.1 Nucleotide Sequence 

pBAD-cDi/ 3656-3682 TCCATGGGTGAAAAAACAAGCACCGGc The nucleotide sequence of the enzyme is shown in SEQ 

GBD-dir ^ jrj) jsj 0: 4 j t i s composed of a reading frame of 8506 nucle- 

pBAD-GBD/ 6167-6189 ACCATGGA TATG TCTACTAATGC OtideS. 

CD2-dir The nucleotide sequence for insertion into the pCR-T7- 

dsrE plasmid contained a ribosome binding site (RBS), 9 

pbad-cdi/ 3638-3658 taact gt ttagg CAAGAATCC bases upstream of the ATG initiation codon and was com- 

GBD- inv r 



EXAMPLE 2 
Characterization of dsrE and DSR-E Sequences 



25 



posed of a hexanucleotide GAGGAA. 



pbad-gbd/ 6 14 6 - 6172 TAATGTATTAGTGAATAAGTATTCAcc 2.2 Analysis of Amino Acid Sequence 
CD2 - inv gjQ^ Qudeotide ds r E sequence encodes a 2835 amino 
pBAD-ent- 8714 - 8737 AATTTGAGGTAATGTTGATTTAT c acid protein shown in SEQ ID No: 2. The isoelectric point for 
inv this protein is 4.88 and its theoretical molecular weight is 
30 313.2 kDa. Despite strong similarities with known dextran- 

, j « j * sucrases, DSR-E is characterized by an original structure. 

The above direct and reverse primers were designed to . J .~\ 

ensure translational fusion of the N-terminal thioredoxin tag Alignment of the amino and sequence with known glyco- 

and the C-terminal polyhistidine tag of the truncated protein Vi^Osmes and dextransucrases confirmed that the struc- 
forms, satisfying the open reading frames for the regions m me glycosyltransfenise domain and dextransucrases 

encoding said forms 35 domain was conserved, namely: a signal sequence, a variable 

lft , „ . _ taha *t*i» * ^ • i • « * ♦ zone, a highly conserved catalytic domain and a glucan bind- 

IfthepBAD-TOPOThiofusion p asm.dconfcnn a specific . domal ^ is structure is 4 0wn m FIG . la) 8 
restriction site for the Ncol enzyme located at the 5' end of the * . „_ x , t . ' . _ 

region encoding thioredoxin, a second Ncol site is introduced u As mdicated m FI p' lb )> a sec ° nd catalvtlc dom ? in 

into each direct primer to enable extraction of that region if ^^7"^^ p0r ?° nof as confirmed bv 

required 40 PRODOM and Blast analysis. 

' TheP(^amplificationreactioiiswerecarriedoutusingthe With a molecular weight of 313.2 kDa, DSR-E had about 

"Expand Long Template" system under the following condi- ^ ice the mean molecular weight of other glycosyltrans- 

tions: pre-denaturing at 94° C. for 3 minutes followed by 25 ferases *** ^transucrases ( ), which is in agreement with 

cyclesat94°C.for30seconds,52°C. for 30 seconds and 68° the Presence of a second catalytic domam at the c-terminal 
C for 7 minutes 0 w a * on 8 er giucan binding region. 

The amplification products generated were then cloned a) Analysis of Signal Sequence: 

into the pBAD-TOPO Thiofusion vector for subsequent ^ Si &^ sequence and the nucleotide 'sequence encoding 

transformation of the E, coli TOP 10 strain. Recombinant me P"^ "Z™ 1 were ■"gMy conserved if compared with 

clones were selected, their restriction profile analyzed to ~ other dextransucrases, as shown in FIG. 3. The cleavage site 

identify a recombinant plasmid carrying the insertion orien- * ^ located between ammo acids 40 and 41. 
tated as expected for each of the investigated forms. b ) Variable Domam: 

Downstream of the signal peptide, DSR-E had a 207 amino 

EXAMPLE 1 ac *d variable domain. When it was compared with other vari- 

55 able glycosyltransferase domains, using a SAPS type align- 

Characterization and Purification of the DSR-E mentprogram, the presence of a 14 amino acid motif repeated 

Enzyme and Obtaining the dsrE Gene 1 1 times was revealed, as indicated in FIG. 4. 

This alanine-, threonine- and aspartic acid-rich repeat 

The enzymes produced by L mesenteroides cultures and motif has never before been identified, 
obtained on a polyacrylamide gel in SDS as described in the 60 The role md significance of this region has never been 

Materials and Methods section were isolated by cutting the elucidated. Different studies have shown that its deletion does 

g e ] . not affect enzymatic activity (4). The role of the 1 4 amino acid 

The GOSs produced by the isolated enzymes were ana- "V* 1 motif > which does not » °& er glycosyltrans- 

lyzed by HPLC using the methods described in (1). The ferases, remains to be determined, however, 
enzyme the activity of which was being investigated was 65 c) Analysis of Catalytic Domains: 
deduced from the nature of the GOSs produced. After trypsic The first catalytic domain extends from amino acids 248 to 

proteolysis and separation of the peptides produced by 1142 (CD1) of SEQ ID No: 2, while the second is located 
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between amino acids 1980 and 2836 (CD2). These two 
domains have 45% identity and 65% similarity between 
them. 

CD1 and CD2 contain amino acids already identified in 
glycosyltransferases and dextransucrases as being essential 
to their enzymatic activity, as shown in FIG. 5. 

The catalytic triads of CD1 and CD2 determined by anal- 
ogy with a amylase (7) are present in the following positions : 

(Asp 527/Glu 565/Asp 638 for CD1 and Asp 2210/Glu 
2248/Asp 2322 for CD2). 

Other conserved residues were identified as being impor- 
tant for enzymatic activity: the residues Trp 425/Glu 430 for 
CD1 and Trp 2122/Glu 21 27 for CD2, which are analogous to 
those of the N-terminal domain of GFT1 described by Mon- 
chois et al (4): Trp 344/Ghi 349. 

In contrast, certain sequences located in the conserved 
region of the glycosyltransferases and dextransucrases are 
not found in the CD2 of DSR-E. Thus, as indicated in FIG. 5 
below, the sequences FIHNDT (SEQ ID NO: 35) (2214- 
2220) and KGVQEKV (SEQ ID NO: 36) (2323-2329) 
diverge from other consensus sequences of dextransucrases 
already studied, which are respectively NVDADLL (SEQ ID 
NO: 37) and SEVQTV1 (SEQ ID NO: 38). 

d) Glucan Binding Domain: 

When the DSR-E sequence is compared with known 
sequences, it appears that the glucan binding region is sub- 
stantially longer. In fact, the length of this domain is about 
500 amino acids in the glycosyltransferases and dextransu- 
crases being studied while in DSR-E, it represents 836 amino 
acids. Several A and C repeat motifs, in particular a series of 
AC repetitions, have been identified. Table III below shows 
the consensus sequences of the repeat motifs of GBD, in 
particular the A and C motifs, described in the literature 
relating to dextransucrases of Leucononstoc and Streptococ- 
cus spp. 



Identification and Characterization of Enzymatic Activity 
Using a glucose acceptor molecule, the dextransucrases 
produced by recombinant E. coli were compared with those 
produced by L. mesenteroides NRRL B-1299. 
5 HPLC analysis of the reaction products with recombinant 
DSR-E (FIG. 6) snowed retention times corresponding to the 
previously identified GOSs R4 and R5 (2). Type R oligosac- 
charides are linear GOS series, the a(l -*2) bond being linked 
to the non-reducing end. The OD series, linear GOSs result- 
10 ing from glycoside a(l -»6) bonds with a maltose residue at 
the reducing end was observed in very small quantities. Three 
novel compounds, in contrast, were detected in the recombi- 
nant enzyme products. 
Identification of GOSs Produced: 
1 5 Finally, FIG. 6b clearly shows that peaks 5 and 7 represent- 
ing the GOSs of the R series are relatively larger with the 
recombinant enzyme than with the native enzyme in which 
the peaks corresponding to panose and OD5 are larger. 

2Q EXAMPLE 4 

Effect of Deletion of CD2 on the Enzymatic Activity 
of DSR-E 



The genomic DNA of L mesenteroides NRRL B-l 299 was 
used as a matrix to amplify the dsrE gene by PCR deleted 
from the sequence corresponding to the second catalytic 
domain. To this end, 2 oligonucleotides, ECHO-dir (5'-AGT- 
TGTATGAGAGACATGAGGGTAATTTGT- 
GACCGTAAAAAATTG) (SEQ ID NO: 48) corresponding 
to the nucleotide sequence -6 to 39 and containing the trans- 
lation initiation codon, and ECHO-inv-del (5'-GTAITAGT- 
GAATAAGTATTCACCATTGCATT- 
TATCGTCAAAAEAGTACG) (SEQ ID NO: 49) 
complementary to the sequence 5889-5937 and correspond- 
ing to the peptide sequence YYFDDKGNGEYCFTNT (SEQ 



30 



TABLE 3 



Motif 



SEQ ID HQS: 39-43 
Consensus sequence 



A WWYPNxDGQAATGLQTIDGQTVPDDNGxQVKG 

B VNGKTYYFGSDGTAQTQANPKGQTFKDGSGVLRFYNLEGQYVSGSGWY 

C DGKIYFFDPDSGEWKURFV 

D GGWKNADGTYSKY 

N YYFxAxQGxxxL 



x: any amino acid 



EXAMPLE 3 
Expression of dsrE in E. coli 

E. coli BL21 (DE3) pLysS pCR-T7-dsrE cells were culti- 
vated as described above. After polyacrylamide gel electro- 
phoresis (page-SDS), analysis of the protein extracts effec- 
tively revealed the presence of several bands having 
saccharase dextran activity, said activity being measured as 
described above. 

The E. coli JM109 [pCR-T7-dsrD] line was deposited at 
the CNCM on 15 th Mar. 2001 with accession number 1-2649. 



55 ID NO: 44), were synthesized, to fuse the C-terminal end of 
the deleted protein with a His tag present on the cloning 
vector. The PCR reaction was carried out using a DNA ther- 
mal cycler model 2400 (Perkin Elmer) with the Expand Long 
Template System (Boehringer Mannheim) using the follow- 

60 ing temperature cycle: 94° C. for 3 min, then 25 cycles with: 
30 s at 94° C, 30 s at 55° C. and 7 min at 68° C. The PCR 
product was then cloned into the pUNI donor vector and the 
resulting plasmid was used in a recombination reaction with 
the pCR-T7-AdsrE expression vector. 

65 The cell extract, preparation, enzymatic reaction and reac- 
tion product analysis were those described in Example 3 
above. 
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The HPLC profile of the GOSs obtained with the DSR-E 
enzyme deleted from the CD2 domain appear in FIG. 6c). 

The type R GOS shown as peaks 5 and 7 shown in FIGS. 
6a) and 6b) are entirely absent from the products obtained 
with the recombinant enzyme deleted from CD2. The only 5 
analyzable products were those corresponding to linear oli- 
gosides resulting from a(l-*6) bonds with a maltose residue 
in the reducing portion. This result clearly indicates the essen- 
tial role of the catalytic domain located in the carboxy-tenni- 
nal portion of the enzyme in its capacity to form a(l->2) 10 
osidic bonds. 

EXAMPLE 5 

Study of Structure-function Relationships of DSR-E 15 
Protein 

The dsrE gene, insofar as it is the first gene encoding a 
dextransucrase catalyzing the synthesis of a(l-»2) bonds to 
have been cloned, is of particular interest. Thus, it is impor- 20 
tant to characterize this gene and its expression product, in 
this case by determining the roles of the different domains 
making up the DSR-E protein in the function which has been 
assigned thereto, namely to correspond to a ct(l -»2 specific to 
the synthesis of a(l -*2) bonds. 25 

5.1 Deleted forms of DSR-E Protein: 

A study of six different forms obtained by deletion of one 
or more domains from the DSR-E protein was envisaged in 
order to determine the following by reference to FIG. 1 
below: (i) the influence of the presence of the CD2 domain by 30 
studying GBD-CD2 and A(CD2) constructions; (ii) the influ- 
ence of the presence of the variable zone by analyzing the 
A(ZV) and CD1-GBD forms; and (iii) the intrinsic catalytic 
potential of the CD1 and CD2 domains expressed in an iso- 
lated manner (CD1 and CD2 constructions). 35 

The catalytic activity of each of the different forms was 
compared with that observed with the control corresponding 
to the entire form deleted from the single signal peptide A(PS) 
[FIG. lc)]. 

5.2 Analysis of Constructions: 40 
At the end of the experimental PCR amplification and 

cloning procedure detailed above, several clones with an 
insertion in the expected orientation were obtained for each of 
the envisaged constructions, with the exception of the trun- 
cated GBD-CD2 form for which the desired amplification 45 
product could not be cloned. 

The sequences for the insertions were determined in order 
to ensure the absence of mutations that after translation may 
modify the amino acids located at positions presumed essen- 
tial for the enzymatic activity of the protein encoded this way. 50 

A mutation was identified at the 3 1" insertion base relative 
to the control A(PS), inducing substitution of one aspartic 
acid by an asparagine in position 1 0 of the variable zone. As 
it is not located in the repeat motifs S of the variable zone 
(FIG. 4), it appears that the incidence of this mutation on the 55 
finally observed function is negligible. 

A mutation was introduced into the amplification product 
corresponding to the construction A(CD2), modifying the 
aromatic residue F 1 411 in leucine. This mutation was located 
in the first third of the glucan binding domain GBD at a 60 
junction between two repeat motifs. 

Because of the errors made by polymerase during PCR 
amplification, the construction A(ZV) did not have the 
expected sequence. In fact, the insertion contained an open 
reading frame, that frame essentially corresponding to the 65 
GBD-CD2 form which could not be cloned. However, in the 
GBD-CD2 form obtained definitively in place of A(ZV), 46 
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N-terminal residues were absent. Now, the GBD domain has 
more than 800 amino acids forming a concatenation of 24 
repeat units. This concatenation is such that, over the 46 
truncated residues, only the last 9 were located at one of said 
units, in particular at the first thereof. It thus appears plausible 
to consider that deletion of these amino acids has no influence 
on the enzymatic reaction catalyzed by the corresponding 
protein form. This hypothesis supported by the fact that in 
other dextransucrases, the loss of a certain number of repeat 
units from the GBD domain does not significantly reduce the 
activity of the resulting protein. 

The insertion encoding the CD1-GBD form contained a 
mutation affecting the F633 residue located in the CD1 
domain and more precisely in the region that is highly con- 
served in dextransucrases, itself located just in front of the 
second aspartic acid of the catalytic triad (FIG. 5). The 
expected phenylalanine was substituted by a leucine. It is 
difficult at this stage to estimate the impact of such a mutation 
on the observed catalytic activity. 

The sequence of insertions encoding the catalytic domains 
CD1 and CD2 was detennined in the same manner as for the 
other constructions. 

5.3 Expression Products and Enzymatic Activities 

The proteins corresponding to the various deleted forms of 
DSR-E were expressed by subjecting the recombinant E. coli 
cells to induction by L-arabinose in a concentration of 
0.002%. The enzymatic activity was observed for the first 
four hours following induction. 

The protein extracts obtained by sonication of the cell 
residues were analyzed by SDS-PAGE electrophoresis (Sam- 
brook and Russel , 2001 , supra). The molecular masses of the 
recombinant proteins were estimated from the electro- 
phoretic profiles obtained, said masses essentially corre- 
sponding to the expected masses taking into account the 12 
kDa incrementation linked to the thioredoxin tag. Table IV 
below summarizes the estimated values for the molecular 
masses of the different truncated forms and, by way of com- 
parison, provides the expected masses. 



TABLE IV 





Expected mass 


Expected mass + 


Estimated mass 


Protein form 


(kDa) 


thioredoxin (kDa) 


(kDa) 


A(PS) 


309 


321 


324 


A(CD2) 


218 


230 


ND 


GBD- CD 2 


224 


/ 


233 


CD! -GBD 


193 


205 


199 


CD1 


99 


111 


111 


CD2 


95 


107 


ND 


ND: not determined 



Table V below indicates the nature and position of amino 
acids marking the start and end of the protein forms con- 
structed in this study. The different positions refer to SEQ ID 
No: 2 corresponding to the protein DSR-E. 



TABLE V 


Protein form 


Starting amino acid Ending amino acid 


Total length 


A(PS) 


N41 


12835 


2795 


A(CD2) 


Ml 


L1980 


1980 


GBD-CD2 


Ml 188 


12835 


1648 


CD1-GBD 


1248 


L1980 


1733 


CD1 


1248 


Q1141 


894 


CD2 


D1981 


12835 


855 



The GBD-CD2 form did not have a thioredoxin tag. In fact, 
this form was derived from experimental uncertainty occa- 
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sioned by the procedure for PCR amplification of the 
sequence assumed to encode the A(ZV) form. Because of the 
deletions from the sequences thus generated, the thioredoxin 
tag, in principle situated at 5' from the protein of interest, 
could not be fiised with the GBD-CD2 region. 

The quality of the electrophoresis gels did not allow deter- 
mination as to whether the level of expression of the different 
forms was quantitatively identical and as a result whether said 
forms were present in the same proportions in the cell 
extracts. 

The activity measurements provided were established on 
the basis of a given volume of cell extracts but could not be 
extrapolated to the quantity of each protein actually contained 
in said volume of extracts. 

The synthesis of dextran polymers in situ by incubating 
electrophoresis gels in a saccharose solution and subsequent 
staining with SchifTs reagent confirmed the presence of pro- 
teins having a glucan-saccharase activity in cell extracts cor- 
responding to A(PS), A(CD2), GBD-CD2 and CD1-GBD. 

Table VI below shows the maximum enzymatic activities 
observed for each construction. The results confirm the data 
drawn from the experiments in which the gels were stained 
with SchifFs reagent, namely the fact that the cell extracts 
relative to the forms A(PS), A(CD2), GBD-CD2 and CD1- 
GBD had a saccharase activity, in contrast to the two catalytic 
domains taken in isolation. This result was in agreement with 
the literature, given that it has been demonstrated that in other 
dextransucrases, the absence of the GBD domain induced a 
drastic loss of enzymatic activity (8, 9, 10). 



TABLE VI 



Protein 
form 


A(PS) A(CD2) 


GBD-CD2 CD1-GBD 


CD1 CD2 


maximum 


1063 " 181 


86 235 


53 0 


activity 








(U/I) 









The intrinsic activity of the CD1 form was too low to be 
detected. Regarding the GBD-CD2 form, it had a non negli- 
gible activity which leads to the conclusion that the corre- 
sponding structural organization, namely a catalytic domain 
downstream of the glucan binding domain, remains enzy- 
matically active. 

5.4 Effect of Deletions on Oligoside Synthesis: 

Provided that the specificity of the synthesis of a(l-»2) 
bonds was conserved during the reaction in the presence of an 
acceptor, experiments for synthesizing oligosides starting 
from maltose were carried out (FIG. 7). 

When the reactions were carried out to completion, i.e. all 
of the saccharose had been consumed, the oligoside synthesis 
yields were calculated. The results are shown in Table VI 
below. Only the reaction involving the cell extract containing 
the protein form CD1 did not allow such a calculation. The 
temperature effect probably resulted in inactivation of the 
very low activity present in the protein extract. 



TABLE VII 





Yield of 


Yield of 






oligosides in 


oligosides in 




Protein form 


OD series (%) 


R. series (%) 


Total oligoside yield 


Native enzyme 


36 


28 


64 


A(PS) 


41 


14 


55 


A(CD2) 


67 




68 


GBD-CD2 


45 


47 


92 


CDl-GBD 


100 


0 


100 



,049 B2 

24 

As indicated in FIG. 7 below, the presence of oligosides 
from series R was only detected with enzymatic forms having 
the catalytic domain CD2, with the exception of the case in 
which said domain was isolated and then rendered com- 

5 pletely inactive. In fact, the retention time for the oligosides 
synthesized by the deleted form of the second catalytic 
domain and by the CDl-GBD form corresponded only to 
those in the OD series, i.e. to GOSs deprived of a(l-*2) 
bonds. These results thus indicate that the CD2 domain was 

10 required for the formation of a(l-*2) bonds. 

The products obtained with the GBD-CD2 form have sup- 
ported these observations. This construction, which had CD2 
as the only catalytic domain, was capable of catalyzing in a 

15 preponderant manner the synthesis of oligosides from the R 
series, having ct(l -»2) bonds. Thus, this results demonstrates 
that specificity in terms of the function of the DSR-E enzyme 
resides in the highly original sequence for this domain, and 
not in the association of two catalytic domains. Further, the 

20 GBD-CD2 protein form also allowed the synthesis of 
a(l->6) bonds. However, the low yields obtained for these 
oligosides indicated that they were preferentially converted 
into oligosides with a higher degree of polymerization 
belonging to the R series, which prevented their accumulation 

25 in the reaction medium, differing from molecules from the R 
series which were not converted (2). 

By comparing the profiles of the products obtained as 
shown in FIG. 7, it is clear that the entire form A(PS) mainly 
synthesizes linear oligosides. In fact, the molecule R4 was 

30 absent and the oligoside R5 only present in a small amount. 
The catalytic domain CD1 catalyzed the exclusive synthesis 
of a( 1 — *6) bonds and its activity appeared to be preponderant 
with respect to that of the CD2 domain. In addition, in the 
entire form of the enzyme, the imp] ication of the CD2 domain 

35 would thus be less important because of: (i) lower mtrinsic 
catalytic parameters; and/or (ii) a global enzyme configura- 
tion that was unfavorable to its activity. 
Further, the entire enzyme A(PS) catalyzed the synthesis of 

40 oligosides from the R series with a lower yield than that 
observed with the mixture of dextransucrases produced by L. 
mesenteroides NRRL B-1299 (FIG. 7). The yield obtained, 
28%, was situated between those observed for the entire form 
A(PS) and for the GBD-CD2 form. It is known that the wild 

45 strain produces several forms of dextransucrases that are sus- 
ceptible of synthesizing osidic bonds, in particular a(lA2) 
bonds. One hypothesis has been proposed, in which said 
forms are the degradation products of DSR-E. Insofar as the 
truncated forms of DSR-E such as GBD-CD2 could catalyze 

50 the synthesis of oligosides from the R series more effectively, 
it would appear that the yields obtained with the heteroge- 
neous mixture produced by L. mesenteroides NRRL B-l 299 
can be attributed to the contribution of the catalytic activities 
of the ensemble of said different enzymatic forms. 

55 In conclusion, by isolating a particular dextransucrase pro- 
duced by L. mesenteroides , the inventors have succeeded in 
characterizing a particular and unexpected structure of this 
enzyme that can produce oligosides of interest and have 
a(l ->2) type linkages. Identification and characterization of 

go this sequence allows the construction of recombinant cells or 
organisms specifically expressing this enzyme and also 
allows its modification by directed or random mutagenesis or 
by DNA shuffling to further improve its characteristics to be 
envisaged. 

65 This invention can also improve the yield and reproduc- 
ibility of the production of GOSs of interest for the different 
applications cited above. 
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SEQUENCE LISTING 



<160> NUMBER OF SEQ ID NOS : 103 

<210> SEQ ID NO I 
<211> LENGTH: 855 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<220> FEATURE: 

<223> OTHER INFORMATION: catalytic domain 
<400> SEQUENCE: 1 

Asp Met Ser Thr Aon Ala Phe Ser Thr Lys Asn Val Ala Phe Asn His 
15 10 15 

Asp Ser Ser Ser Phe Asp His Thr Val Asp Gly Phe Leu Thr Ala Asp 
20 25 30 

Thr Trp Tyr Arg Pro Lys Ser lie Leu Ala Asn Gly Thr Thr Trp Arg 
35 40 45 

Asp Ser Thr Asp Lys Asp Met Arg Pro Leu He Thr Val Trp Trp Pro 
50 55 60 

Asn Lys Asn Val Gin Val Asn Tyr Leu Asn Phe Met Lys Ala Asn Gly 
65 70 75 80 

Leu Leu Thr Thr Ala Ala Gin Tyr Thr Leu His Ser Asp Gin Tyr Asp 
85 90 95 

Leu Asn Gin Ala Ala Gin Asp Val Gin Val Ala He Glu Arg Arg He 
100 105 110 

Ala Ser Glu His Gly Thr Asp Trp Leu Gin Lys Leu Leu Phe Glu Ser 
115 120 125 

Gin Asn Asn Asn Pro Ser Phe Val Lys Gin Gin Phe He Trp Asn Lys 
130 135 140 

Asp Ser Glu Tyr His Gly Gly Gly Asp Ala Trp Phe Gin Gly Gly Tyr 
145 150 155 160 

Leu Lys Tyr Gly Asn Asn Pro Leu Thr Pro Thr Thr Asn Ser Asp Tyr 
165 170 175 

Arg Gin Pro Gly Asn Ala Phe Asp Phe Leu Leu Ala Asn Asp Val Asp 
180 185 190 



Asn Ser Asn Pro Val Val Gin Ala Glu Asn Leu Asn Trp Leu His Tyr 
195 200 205 
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-continued 



Leu Met Asn Phe Gly Thr He Thr Ala Gly Gin Asp Asp Ala Asn Phe 
210 215 220 

Asp Ser He Arg He Asp Ala Val Asp Phe He His Asn Asp Thr He 
225 230 235 240 

Gin Arg Thr Tyr Asp Tyr Leu Arg Asp Ala Tyr Gin Val Gin Gin Ser 
245 250 255 

Glu Ala Lys Ala Asn Gin His He Ser Leu Val Glu Ala Gly Leu Asp 
260 265 270 

Ala Gly Thr ser Thr He His Asn Asp Ala Leu He Glu Ser Asn Leu 

275 280 285 

Arg Glu Ala Ala Thr Leu Ser Leu Thr Asn Glu Pro Gly Lye Asn Lys 
290 295 300 

Pro Leu Thr Aen Met Leu Gin Asp Val Asp Gly Gly Thr Leu He Thr 
305 310 315 320 

Asp His Thr Gin Aen Ser Thr Glu Asn Gin Ala Thr Pro Asn Tyr Ser 
325 330 335 

He He Hlo Ala His Asp Lys Gly Val Gin Glu Lys Val Gly Ala Ala 
340 345 350 

He Thr Asp Ala Thr Gly Ala Asp Trp Thr Asn Phe Thr Asp Glu Gin 
355 360 365 

Leu Lys Ala Gly Leu Glu Leu Phe Tyr Lys Asp Gin Arg Ala Thr Asn 
370 375 380 

Lys Lys Tyr Asn Ser Tyr Asn He Pro Ser He Tyr Ala Leu Met Leu 
385 390 395 400 

Thr Asn Lys Asp Thr Val Pro Arg Met Tyr Tyr Gly Asp Met Tyr Gin 
405 410 415 

Asp Asp Gly Gin Tyr Met Ala Asn Lys Ser He Tyr Tyr Asp Ala Leu 
420 425 430 

Val Ser Leu Met Thr Ala Arg Lys Ser Tyr Val Ser Gly Gly Gin Thr 
435 440 445 

Met Ser Val Asp Asn His Gly Leu Leu Lys Ser Val Arg Phe Gly Lys 
450 455 460 

Asp Ala Met Thr Ala Asn Asp Leu Gly Thr Ser Ala Thr Arg Thr Glu 

465 470 475 480 

Gly Leu Gly Val He He Gly Asn Asp Pro Lys Leu Gin Leu Asn Asp 
485 490 495 

Ser Asp Lys Val Thr Leu Asp Met Gly Ala Ala His Lys Asn Gin Lys 
500 505 510 

Tyr Arg Ala Val He Leu Thr Thr Arg Asp Gly Leu Ala Thr Phe Asn 
515 520 525 

Ser Asp Gin Ala Pro Thr Ala Trp Thr Asn Asp Gin Gly Thr Leu Thr 
530 535 540 

Phe Ser Asn Gin Glu He Asn Gly Gin Asp Asn Thr Gin He Arg Gly 
545 550 555 560 

Val Ala Asn Pro Gin Val Ser Gly Tyr Leu Ala Val Trp Val Pro Val 
565 570 575 

Gly Ala Ser Asp Asn Gin Asp Ala Arg Thr Ala Ala Thr Thr Thr Glu 
580 585 590 

Asn His Asp Gly Lys Val Leu His Ser Asn Ala Ala Leu Asp Ser Asn 
595 600 605 

Leu He Tyr Glu Gly Phe Ser Asn Phe Gin Pro Lys Ala Thr Thr His 
610 615 620 
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Asp Glu Leu Thr Asn Val Val He Ala Lys Aon Ala Asp Val Phe Asn 
625 630 635 640 

Asn Trp Gly He Thr Ser Phe Glu Met Ala Pro Gin Tyr Arg Ser Ser 
645 650 655 

Gly Asp His Thr Phe Leu Asp Ser Thr He Asp Asn Gly Tyr Ala Phe 
660 665 670 

Thr Asp Arg Tyr Asp Leu Gly Phe Asn Thr Pro Thr Lys Tyr Gly Thr 
675 680 685 

Asp Gly Asp Leu Arg Ala Thr He Gin Ala Leu His His Ala Asn Met 
690 695 700 

Gin Val Met Ala Asp Val Val Asp Asn Gin Val Tyr Asn Leu Pro Gly 
70S 710 715 720 

Lys Glu Val Val Ser Ala Thr Arg Ala Gly Val Tyr Gly Asn Asp Asp 
725 730 735 

Ala Thr Gly Phe Gly Thr Gin Leu Tyr Val Thr Asn Ser Val Gly Gly 
740 745 750 

Gly Gin Tyr Gin Glu Lys Tyr Ala Gly Gin Tyr Leu Glu Ala Leu Lys 
755 760 765 

Ala Lys Tyr Pro Asp Leu Phe Glu Gly Lys Ala Tyr Asp Tyr Trp Tyr 
770 775 780 

Lys Asn Tyr Ala Asn Asp Gly Ser Asn Pro Tyr Tyr Thr Leu Ser His 
785 790 795 800 

Gly Asp Arg Glu Ser He Pro Ala Asp Val Ala He Lys Gin Trp Ser 
805 810 815 

Ala Lys Tyr Met Asn Gly Thr Asn Val Leu Gly Asn Gly Met Gly Tyr 
820 825 830 

Val Leu Lys Asp Trp His Asn Gly Gin Tyr Phe Lys Leu Asp Gly Asp 
83 5 840 845 

Lys Ser Thr Leu Pro Gin He 
850 855 

<210> SEQ ID NO 2 
<211> LENGTH: 2835 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc me sent ero ides 
<220> FEATURE: 

<223> OTHER INFORMATION : Complete protein DSR-E 
<400> SEQUENCE: 2 

Met Arg Asp Met Arg Val He Cys Asp Arg Lys Lys Leu Tyr Lys Ser 
1 ' 5 10 15 

Gly Lys Val Leu Val Thr Ala Gly He Phe Ala Leu Met Met Phe Gly 
20 25 30 

Val Thr Thr Ala Ser Val Ser Ala Asn Thr He Ala Val Aep Thr Asn 
35 40 45 

His Ser Arg Thr Ser Ala Gin He Asn Lys Ser Ala Val Asp Lys Val 
50 55 60 

Asn Asp Asp Lys Thr Thr Leu Gly Ala Ala Lys Val Val Ala Val Ala 
65 70 75 80 

Thr Thr Pro Ala Thr Pro Val Ala Asp Lys Thr Val Ser Ala Pro Ala 
85 90 95 

Ala Asp Lys Ala Val Asp Thr Thr Ser Ser Thr Thr Pro Ala Thr Asp 
100 105 110 

Lys Ala Val Asp Thr Thr Pro Thr Thr Pro Ala Ala Asp Lys Ala Val 
115 120 125 
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Asp Thr Thr Pro Thr Thr Pro Ala Ala Asp Lys Ala Val Asp Thr Thr 
130 135 140 

Pro Thr Thr Pro Ala Ala Asn Lys Ala Val Asp Thr Thr Pro Ala Thr 
145 ISO 155 160 

Ala Ala Thr Asp Lys Ala Val Ala Thr Pro Ala Thr Pro Ala Ala Asp 
165 170 175 

Lys Leu Ala Asn Thr Thr Pro Ala Thr Asp Lye Ala val Ala Thr Thr 
180 185 190 

Pro Ala Thr Pro Val Ala Asn Lys Ala Ala Asp Thr Ser Ser lie His 
195 200 205 

Asp Gin Pro Leu Asp Thr Asn Val Pro Thr Asp Lys Ser Ala Asn Leu 
210 215 220 

Val Ser Thr Thr Gin Lys Ser Thr Asp Asn Gin Gin Val Lys Ser Thr 
225 230 235 240 

Glu Thr Ser His Leu Gin Glu lie Asn Gly Lys Thr Tyr Phe Leu Asp 
245 250 255 

Asp Asn Gly Gin Val Lys Lys Asn Phe Thr Ala He He Asp Gly Lys 
260 265 270 

Val Leu Tyr Phe Asp Lys Thr Ser Gly Glu Leu Thr Ala Asn Ala Pro 
275 280 285 

Gin Val Thr Lys Gly Leu Val Asn He Asp Asn Ala His Asn Ala Ala 
290 295 300 

His Asp Leu Thr Ala Asp Asn Phe Thr Asn Val Asp Gly Tyr Leu Thr 
305 310 315 320 

Ala Asn Ser Trp Tyr Arg Pro Lys Asp He Leu Lys Asn Gly Thr Thr 
325 330 335 

Trp Thr Pro Thr Thr Ala Glu Asp Phe Arg Pro Leu Leu Met Ser Trp 
340 345 350 

Trp Pro Asp Lys Asn Thr Gin Val Ala Tyr Leu Gin Tyr Met Gin Ser 
355 360 365 

Val Gly Met Leu Pro Asp Asp Val Lys Val Ser Asn Asp Asp Asn Met 
370 375 380 

Ser Thr Leu Thr Asp Ala Ala Met Thr Val Gin Lys Asn He Glu Ser 
385 390 395 400 

Arg He Gly Val Ser Gly Lys Thr Asp Trp Leu Lys Gin Asp Met Asn 
405 410 415 

Lya Leu He Asp Ser Gin Ala Asn Trp Asn He Asp Ser Glu Ser Lys 
420 425 430 

Gly Asn Asp His Leu Gin Gly Gly Ala Leu Leu Tyr Val Asn Asp Asp 
435 440 445 

Lys Thr Pro Asn Ala Asn Ser Asp Tyr Arg Leu Leu Asn Arg Thr Pro 
450 455 460 

Thr Asn Gin Thr Gly Gin He Thr Asp Pro Ser Lys Gin Gly Gly Tyr 
465 470 475 480 

Glu Met Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro Val Val Gin 

485 490 495 

Ala Glu Gin Leu Asn Trp Leu His Tyr Met Met Asn He Gly Thr He 
500 505 510 

Ala Gin Asn Asp Pro Thr Ala Asn Phe Asp Gly Tyr Arg Val Asp Ala 
515 520 525 

Val Asp Asn Val Asp Ala Asp Leu Leu Gin He Ala Gly Asp Tyr Phe 
530 535 540 



Lyo Ala Ala Tyr Gly Thr Gly Lyo Thr Glu Ala Aon Ala Aon Aon His 
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He Ser He Leu Glu Asp Trp Asp Asn Asn Asp Ser Ala Tyr He Lys 
565 570 575 

Ala His Gly Asn Asn Gin Leu Thr Met Asp Phe Pro Ala His Leu Ala 
580 585 590 

Leu Lys Tyr Ala Leu Asn Met Pro Leu Ala Ala Gin Ser Gly Leu Glu 
595 600 605 

Pro Leu He Asn Thr Ser Leu Val Lys Arg Gly Lys Asp Ala Thr Glu 
610 615 620 

Asn Glu Ala Gin Pro Asn Tyr Ala Phe He Arg Ala His Asp Ser Glu 
625 630 635 640 

Val Gin Thr Val He Ala Gin He lie Lys Asp Lys lie Asn Thr Lys 
645 650 655 

Ser Asp Gly Leu Thr Val Thr Pro Asp Glu He Lys Gin Ala Phe Thr 
660 665 670 

lie Tyr Asn Ala Asp Glu Leu Lys Ala Asp Lys Glu Tyr Thr Ala Tyr 
675 680 685 

Asn He Pro Ala Ser Tyr Ala Val Leu Leu Thr Asn Ly3 Asp Thr Val 
690 695 700 

Pro Arg Val Tyr Tyr Gly Asp Leu Phe Ser Asp Asp Gly Gin Tyr Met 
705 710 715 720 

Ser Gin Lys Ser Pro Tyr Tyr Asp Ala He Thr Ser Leu Leu Lys Ser 
725 730 735 

Arg He Lys Tyr Val Ala Gly Gly Gin Ser Met Asn Met Thr Tyr Leu 
740 745 750 

His Glu Cys Phe Asp Pro Ala Lys Asn Glu Thr Lys Pro Gin Gly Val 
755 760 765 

Leu Thr Ser Val Arg Tyr Gly Lys Gly Ala Met Thr Ala Asp Asp Leu 
770 775 780 

Gly Asn Ser Asp Thr Arg Gin Gin Gly He Gly Leu Val lie Asn Asn 
785 790 795 800 

Lys Pro Phe Leu Asn Leu Asn Asp Asp Glu Gin lie Val Leu Asn Met 
805 810 815 

Gly Ala Ala His Lys Asn Gin Ala Tyr Arg Pro Leu Met Leu Thr Thr 
820 825 830 

Lys ser Gly Leu Gin lie Tyr Asp Lys Asp Ala Gly Ala Pro Val Val 
835 840 845 

Tyr Thr Asn Asp Ala Gly Gin Leu He Phe Lys Ser Asp Met Val Tyr 
850 855 860 

Gly Val Ser Asn Pro Gin Val Ser Gly Tyr Phe Ala Ala Trp Val Pro 
865 870 875 880 

Val Gly Ala ser Asp Ser Gin Asp Ala Arg Thr Gin Ser Ser Gin Ser 
885 890 895 

Glu Thr Lys Asp Gly Asp Val Tyr His Ser Asn Ala Ala Leu Asp Ser 
900 905 910 

Asn Val He Tyr Glu Gly Phe Ser Asn Phe Gin Ala Met Pro Glu Lys 
915 920 925 

Asn Asp Asp Phe Thr Asn Val Lys lie Ala Gin Asn Ala Lys Leu Phe 
930 935 940 

Lys Asp Leu Gly He Thr Ser Phe Glu Leu Ala Pro Gin Tyr Arg Ser 

945 950 955 960 

Ser Thr Asp Asn Ser Phe Leu Asp Ser Val He Gin Asn Gly Tyr Ala 
965 * 970 975 
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Phe Thr Asp Arg Tyr Asp Val Gly Tyr Asn Thr Pro Thr Lya Tyr Gly 
980 985 990 

Thr Val Asp Gin Leu Leu Asp Ser Leu Arg Ala Leu His Ala Gin Gly 
995 1000 1005 

lie Gin Ala lie Asn Asp Trp Val Pro Asp Gin lie Tyr Asn Leu Pro 
1010 1015 1020 

Gly Glu Gin lie Val Thr Ala Val Arg Thr Asn Gly Ser Gly Lys Tyr 
1025 1030 1035 1040 

Asp Tyr Asp Ser Val lie Asn Asn Thr Leu Tyr Asp Ser Arg Thr Val 
1045 1050 1055 

Gly Gly Gly Glu Tyr Gin Glu Lys Phe Gly Gly Leu Phe Leu Asp Gin 
1060 1065 1070 

Leu Lys Lys Asp Tyr Pro Ser Leu Phe Glu Thr Lys Gin lie Ser Thr 
1075 1080 1085 

Asn Gin Pro Met Asn Pro Asp Val Lys lie Lys Glu Trp Ser Ala Lys 
1090 1095 1100 

Tyr Phe Asn Gly Ser Asn lie Gin Gly Arg Gly Ala Trp Tyr Val Leu 
1105 1110 1115 1120 

Lys Asp Trp Ala Thr Asn Gin Tyr Phe Asn Val Ser Ser Asp Asn Gly 
1125 1130 1135 

Phe Leu Pro Lys Gin Leu Leu Gly Glu Lys Thr Ser Thr Gly Phe lie 
1140 1145 1150 

Thr Glu Asn Gly Lys Thr Ser Phe Tyr Ser Thr Ser Gly Tyr Gin Ala 
1155 1160 1165 

Lys Asp Thr Phe He Gin Asp Gly Thr Asn Trp Tyr Tyr Phe Asp Asn 
1170 1175 1180 

Ala Gly Tyr Met Leu Thr Gly Lys Gin Asn He His Asp Lys Asn Tyr 
1185 ' 1190 1195 1200 

Tyr Phe Leu Pro Asn Gly Val Glu Leu Gin Asp Ala Tyr Leu Phe Asp 
1205 1210 1215 

Gly Asn Gin Glu Phe Tyr Tyr Asn Lys Ala Gly Glu Gin Val Met Asn 
1220 1225 1230 

Gin Tyr Tyr Gin Asp Ser Gin Asn Gin Trp His Tyr Phe Phe Glu Asn 
1235 1240 1245 

Gly Arg Met Ala He Gly Leu Thr Glu Val Pro Asn Ala Asp Gly Thr 
1250 1255 1260 

His Val Thr Gin Tyr Phe Asp Ala Asn Gly Val Gin He Lys Gly Thr 
1265 1270 1275 1280 

Ala He Lys Asp Gin Asn Asn Gin Leu Arg Tyr Phe Asp Glu Ala Thr 
1285 1290 1295 

Gly Asn Met Val Val Asn Ser Trp Gly Gin Leu Ala Asp Lys Ser Trp 
1300 1305 1310 

Leu Tyr Leu Asn Ala Gin Gly Val Ala Val Thr Gly Asn Gin Lys He 
1315 1320 1325 

Asp Gly Glu Glu Tyr Tyr Phe Asn Ala Asp Gly Lys Gin Val Lys Gly 
1330 1335 1340 

Asn Ala He He Asp Asn Asn Gly Asp Gin Arg Tyr Tyr Asp Gly Asp 
1345 1350 13S5 1360 

Lys Gly Val Met Val Val Asn Ser Trp Gly Glu Leu Pro A3p Gly Ser 
136S 1370 1375 



Trp Leu Tyr Leu Asn Asp Lys Gly He Ala Val Thr Gly Arg Gin Val 
1380 1385 1390 



Case 1 :09-cv-00630-CKK Document 1-2 Filed 04/06/2009 Page 26 of 53 



US 7,439,049 B2 
37 38 

-continued 

He Asn Asn Gin Val Asn Phe Phe Gly Asn Asp Gly Lye Gin He Lys 
1395 1400 1405 

Asp Ala Phe Lys Leu Leu Ser Asp Gly Ser Trp Val Tyr Leu Asp Asp 
1410 1415 1420 

Lys Gly Leu He Thr Thr Gly Ala Lys Val He Asn Gly Leu Asn Met 
1425 * 1430 1435 1440 

Phe Phe Asp Lys Asp Gly His Gin He Lys Gly Asp Ala Ser Thr Asp 
1445 1450 1455 

Ala Asn Gly Lys Arg His Tyr Tyr Asp Lys Aon Asp Gly His Leu Val 
1460 1465 1470 

Thr Asn Ser Trp Gly Glu Leu Pro Asp Gly Ser Trp Leu Tyr Leu Glu 
1475 1480 1485 

Glu Gin Gly Asp Ala Val Thr Gly Gin Arg Val He Asp Gly Lys Thr 
1490 1495 1500 

Arg Tyr Phe Asp Glu Asp Gly Lys Gin He Lys Asn Ser Leu Lys Thr 
1505 1510 1515 1520 

Leu Ala Asn Gly Asp Lys He Tyr Leu Asp Gly Asp Gly Val Ala Ala 
1525 1530 1535 

Thr Gly Leu Gin His Val Gly Asp Lys He Met Tyr Phe Asp Glu Asp 
1540 1545 1550 ■ 

Gly Lys Gin Val Val Gly Lys Phe Val Ser Ala Lys Asp Gly Ser Trp 
1555 ,1560 1565 

Tyr Tyr Leu Asn Gin Asp Gly Val Ala Ala Val Gly Pro Ser Ser He 
1570 1575 1580 

Asn Gly Gin Ser Leu Tyr Phe Asp Gin Asp Gly Lys Gin Val Lys Tyr 
1585 1590 1595 1600 

Asn Glu Val Arg Asn Ser Asp Gly Thr Thr Asn Tyr Tyr Thr Gly Leu 
1605 1610 1615 

Thr Gly Glu Lys Leu Thr Gin Asp Phe Gly Glu Leu Pro Asp Gly Ser 
1620 1625 1630 

Trp He Tyr Leu Asp Ala Gin Gly His Thr Val Thr Gly Ala Gin He 

1635 1640 1645 

He Asn Gly Gin Asn Leu Tyr Phe Lys Ala Asp Gly Gin Gin Val Lys 
1650 1655 1660 

Gly His Ala Tyr Thr Asp Gin Leu Gly His Met Arg Phe Tyr Asp Pro 
1665 * 1670 1675 1680 

Asp Ser Gly Asp Met Leu Ser Asn Arg Phe Glu Gin He Thr Pro Gly 
1685 1690 1695 

Val Trp Ala Tyr Phe Gly Ala Asp Gly Val Ala He Thr Gly Gin His 
1700 1705 1710 

Asp He Asn Gly Gin Lys Leu Phe Phe Asp Glu Thr Gly Tyr Gin Val 
1715 1720 1725 

Lys Gly Ser Gin Arg Thr lie Asp Gly Thr Leu Tyr Ser Phe Asp Ser 
1730 1735 1740 

Gin Thr Gly Asn Gin Lys Arg Val Gin Thr Thr Leu Leu Pro Gin Ala 
1745 1750 1755 1760 

Gly His Tyr He Thr Lys Asn Gly Asn Asp Trp Gin Tyr Asp Thr ABn 
1765 1770 1775 

Gly Glu Leu Ala Lys Gly Leu Arg Gin Asp Ser Asn Gly Lys Leu Arg 
1780 1785 1790 

Tyr Phe Asp Leu Thr Thr Gly He Gin Ala Lys Gly Gin Phe Val Thr 
1795 1800 1805 

He Gly Gin Glu Thr Tyr Tyr Phe Ser Lys Asp His Gly Asp Ala Gin 
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Leu Leu Pro Met Val Thr Glu Gly His Tyr Gly Thr lie Thr Leu Lys 
1825 1830 1835 1840 

Gin Gly Gin Asp Thr Lys Thr Ala Trp Val Tyr Arg Asp Gin Asn Asn 
1845 1850 1855 

Thr lie Leu Lys Gly Leu Gin Asn lie Asn Gly Thr Leu Gin Phe Phe 
1860 1865 1870 

Asp Pro Tyr Thr Gly Glu Gin Leu Lys Gly Gly Val Ala Lys Tyr Asp 
1875 1880 1885 

Asp Lys Leu Phe Tyr Phe Glu Ser Gly Lys Gly Asn Leu Val Ser Thr 
1890 1895 1900 

Val Ala Gly Asp Tyr Gin Asp Gly His Tyr lie Ser Gin Asp Gly Gin 
1905 1910 1915 1920 

Thr Arg Tyr Ala Asp Lys Gin Asn Gin Leu Val Lys Gly Leu Val Thr 
1925 1930 1935 

Val Asn Gly Ala Leu Gin Tyr Phe Asp Asn Ala Thr Gly Asn Gin lie 
1940 1945 1950 

Lys Asn Gin Gin Val He Val Asp Gly Lys Thr Tyr Tyr Phe Asp Asp 
1955 1960 1965 

Lys Gly Asn Gly Glu Tyr Leu Phe Thr Asn Thr Leu Asp Met Ser Thr 
1970 1975 1980 

Asn Ala Phe Ser Thr Lys Asn Val Ala Phe Asn Hia Asp Ser Ser Ser 
1985 1990 1995 2000 

Phe Asp His Thr Val Asp Gly Phe Leu Thr Ala Asp Thr Trp Tyr Arg 
2005 2010 2015 

Pro Lys Ser He Leu Ala Asn Gly Thr Thr Trp Arg Asp Ser Thr Asp 
2020 2025 2030 

Lys Asp Met Arg Pro Leu He Thr Val Trp Trp Pro Asn Lys Asn Val 
2035 2040 2045 

Gin Val Asn Tyr Leu Asn Phe Met Lys Ala Asn Gly Leu Leu Thr Thr 
2050 2055 2060 

Ala Ala Gin Tyr Thr Leu His Ser Asp Gin Tyr Asp Leu Asn Gin Ala 
2065 2070 2075 2080 

Ala Gin Asp Val Gin Val Ala He Glu Arg Arg He Ala Ser Glu His 
2085 2090 2095 

Gly Thr Asp Trp Leu Gin Lys Leu Leu Phe Glu Ser Gin Asn Asn Asn 
2100 2105 2110 

Pro Ser Phe Val Lys Gin Gin Phe He Trp Asn Lys Asp Ser Glu Tyr 
2115 2120 2125 

His Gly Gly Gly Asp Ala Trp Phe Gin Gly Gly Tyr Leu Lys Tyr Gly 
2130 2135 2140 

Asn Asn Pro Leu Thr Pro Thr Thr Asn Ser Asp Tyr Arg Gin Pro Gly 
2145 2150 2155 2160 

Asn Ala Phe Asp Phe Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
2165 2170 2175 

Val Val Gin Ala Glu Asn Leu Asn Trp Leu His Tyr Leu Met Asn Phe 
2180 2185 2190 

Gly Thr He Thr Ala Gly Gin Asp Asp Ala Asn Phe Asp Ser He Arg 
2195 2200 2205 

He Asp Ala Val Asp Phe He His Asn Asp Thr He Gin Arg Thr Tyr 
2210 2215 2220 

Asp Tyr Leu Arg Asp Ala Tyr Gin Val Gin Gin Ser Glu Ala Lys Ala 
2225 2230 2235 2240 
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Asp Leu Gly Phe Asn Thr Pro Thr Lys Tyx Gly Thr Asp Gly Asp Leu 
2660 2665 2670 

Arg Ala Thr He Gin Ala Leu His His Ala Asn Met Gin Val Met Ala 
2675 2680 2685 

Asp Val Val Asp Asn Gin Val Tyr Asn Leu Pro Gly Lys Glu Val Val 
2690 2695 2700 

Ser Ala Thr Arg Ala Gly Val Tyr Gly Asn Asp Asp Ala Thr Gly Phe 
2705 2710 2715 2720 

Gly Thr Gin Leu Tyr Val Thr Asn Ser Val Gly Gly Gly Gin Tyr Gin 
2725 2730 2735 

Glu Lys Tyr Ala Gly Gin Tyr Leu Glu Ala Leu Lys Ala Lys Tyr Pro 
2740 2745 2750 

Asp Leu Phe Glu Gly Lys Ala Tyr Asp Tyr Trp Tyr Lys Asn Tyr Ala 
2755 2760 2765 

Asn Asp Gly Ser Asn Pro Tyr Tyr Thr Leu Ser His Gly Asp Arg Glu 
2770 2775 2780 

Ser He Pro Ala Asp Val Ala He Lys Gin Trp Ser Ala Lys Tyr Met 
2785 2790 2795 2800 

Asn Gly Thr Asn Val Leu Gly Asn Gly Met Gly Tyr Val Leu Lys Asp 
2805 2810 2815 

Trp His Asn Gly Gin Tyr Phe Lys Leu Asp Gly Asp Lys Ser Thr Leu 
2820 2825 2830 

Pro Gin He 
2835 



<210> SEQ ID NO 3 
<211> LENGTH: 2568 
<212> TYPE: DNA 

<213> ORGANISM: Leuconostoc mesenteroides 
<220> FEATURE: 

<223> OTHER INFORMATION : catalytic domain 
<400> SEQUENCE: 3 



gatatgtcta 


ctaatgcttt 


ttctaccaaa 


aatgttgcat 


tcaatcatga 


cagtagcagt 


60 


ttcgaccata 


ctgttgatgg 


cttcttgacg 


gcagatactt 


ggtatcgacc 


aaagtcaatt 


12 0 


ttggctaacg 


ggacaacttg 


gcgtgattcg 


actgataagg 


atatgcgacc 


attaatcact 


180 


gtttggtggc 


caaataagaa 


tgttcaagtc 


aactacctca 


acttcatgaa 


agcaaatggc 


240 


ttgttgacaa 


cagcagcaca 


acacacacta 


cattcagatc 


aatatgatct 


gaaccaagct 


300 


gcacaagatg 


ttcaagtggc 


cattgaaagg 


cgcattgcgt 


cagagcatgg 


cacagactgg 


360 


ttacagaaat 


tgttgtttga 


atcacaaaat 


aataacccat 


catttgtgaa 


gcaacaattc 


420 


atttggaaca 


aggattctga 


atatcatggt 


ggtggtgatg 


cttggttcca 


aggtggttat 


480 


ctgaagtatg 


gcaataaccc 


actcacacca 


acaactaatt 


ctgattatcg 


tcaacctggt 


540 


aatgcatttg 


atttcttgct 


agccaacgac 


gtggataatt 


ctaatcctgc 


tgtgcaagct 


600 


gaaaacttaa 


actggttaca 


ttacttaatg 


aactttggca 


ccatcactgc 


gggtcaagat 


660 


gacgctaatt 


ttgatagcat 


tcgtattgac 


gctgtcgact 


ttattcataa 


tgatacaatc 


720 


caacgtactt 


atgattatct 


tcgtgatgct 


tatcaagtgc 


aacaaagtga 


agccaaagca 


780 


aaccagcaca 


tttcattggt 


tgaagctggc 


ttagacgcag 


gtacatcaac 


gattcataat 


84 0 


gatgcgttaa 


ttgagtcaaa 


cctccgtgaa 


gcagcgacat 


tgtcgttaac 


aaatgaacct 


900 


ggtaaaaata 


aaccattgac 


gaacatgcta 


caagacgttg 


acggcggt ac 


gcttatcacc 


960 


gaccatacgc 


agaatagtac 


agaaaatcag 


gcgacaccaa 


actattcaat 


tattcacgcg 


1020 
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cacgataaag 


gtgtgcaaga 


aaaagtaggt 


gcagccatta 


ctgatgctac tggtgctgat 


1080 


tggacgaact 


ttacagatga 


acagtcaaaa 


gccggattag 


agctattcta taaggatcag 


X140 


cgcgcaacaa 


acaaaaagta 


taatagttat 


aacataccaa 


gtatttatgc cctgatgttg 


1200 


acaaacaaag 


atactgttcc 


tcgtatgtat 


tatggggata 


tgtatcaaga tgacggacag 


1260 


tatacggcaa 


acaagagtat 


ctactatgat 


gccttagtgt 


cattaatgac ggctcgtaaa 


1320 


agctatgtca 


qcqqtqqtca 


aactatgagt 


gttgacaatc 


atggtttgtt gaagagtgtc 


1380 


cgttttggaa 


aagatgcgat 


gacagctaat 


gatttaggta 


catcagctac gcgtactgag 


1440 


33 u *»33 '"IJ 


tcattattgg 


taatgatcca 


aagttgcaac 


ttaatgattc ggataaagtg 


1500 


acac t gga t a 


tqqqcqcaqc 


acataaaaat 


caaaagtatc 


gcgcagttat cttaacaaca 


1560 


cqtqatqqtt 


tggca&cctt 


taattcagat 


caagcaccaa 


cagcttggac aaacgatcaa 


1620 


ggaacgtt aa 


cattctcaaa 


tcaagagatt 


aacgggcaag 


acaatacaca aattcgtggt 


1660 


gctgctaatc 


cgcaagtttc 


tggttatcta 


gctgtttggg 


tgcctgtggg tgcatcagac 


1740 


aatcaagatg 


cccgtacagc 


agcaacgaca 


acagaaaatc 


atgatggtaa agtattacac 


1900 


tcgaatgcgg 


cattagattc 


taaccttatt 


tatgaaggtt 


tccctaactt ccaacctaag 


I860 


gcaacaacgc 


atgatgaact 


tacgaacgtt 


gtaattgcta 


aaaatgccga tgtcttcaat 


1920 


aattqqqqta 


ttacgagttt 


tgaaatggca 


ccacagtacc 


gttcaagtgg ggaccataca 


1980 


ttcttggatt 


caacgattga 


t aa tggttat 


gcctt cactg 


atcgctatga cttaggtttc 




aatacaccaa 


caaagtatgg 


cactgatggt 


gatttgcgtg 


caacgattca agcgctacat 


2100 


catgctaata 


tgcaagttat 


ggctgacgtt 


gttgataacc 


aggtctataa cttacctggt 


2160 


aaagaagttg 


tttcagcaac 


acgagcaggt 


gtttatggta 


atgacgacgc cacgggcttt 


2220 


ggaacgcaac 


tctatgtgac 


taactccgtt 


ggtggtggcc 


aataccaaga gaaatatgct 


2280 


ggacaatact 


tagaagctct 


gaaagcaaag 


tatccagacc 


tctttgaggg taaggcctat 


2340 


gattattggc 


ataagaacta 


tgcaaatgat 


gggtcaaatc 


cttactatac attgtcacac 


2400 


ggtgaccgtg 


aatctatccc 


agcagatgtt 


gctattaagc 


aatggtcagc taagtatatg 


2460 


aacggcacga 


acgttttggg 


caatggtatg 


ggttatgtat 


tgaaggattg gcataatggt 


2520 


caatatttca 


agcttgatgg 


tgataaatca 


acattacctc 


aaatctaa 


2568 



<210> SEQ ID NO 4 
<211> LENGTH: 8506 
<212> TYPE: DNA 

<213> ORGANISM: Leuconoetoc mesenteroideg 
<220> FEATURE : 

<223> OTHER INFORMATION: Coding sequence DSR-E 
<400> SEQUENCE: 4 

atgagagaca tgagggtaat ttgtgaccgt aaaaaattgt acaaatcggg caaagtacta 60 
gtaacagccg gtatctttgc cttgatgatg tttggcgtca caactgctag tgttagtgca 120 
aatacgattg cagttgacac gaatcatagc cgtacttcag cacagattaa taagagtgcc 180 
gttgataagg ttaatgatga caagactact ttaggagcgg caaaagtagt ggcagtagcc 240 
acaacgccag cgacaccggt agcagataaa acagtaagtg cacccgcagc agataaggca 300 
gtagatacaa cgtcatcaac gacacctgca acggataagg cagtagatac aacgccaacg 360 
acacctgcag cagataaggc agtagataca acgccaacga cacctgcagc agataaggca 420 
gtagatacaa cgccaacgac acctgcagca aataaagcag tagatacaac gccagcgacc 480 
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gctgcaacag 


ataaggcggt 


agccacgcca gccacacccg cagcagataa 


get agcaaat 


540 


acgacgcctg 


caacggacaa 


ggcagtagcc acaacgccag cgacgccggt 


agcaaataaa 


600 


gcagcagaca 


cgagtagtat 


tcatgatcaa ccattagata caaatgtgcc 


aactgataaa 


660 


tcagcaaacc 


tcgtctcgac 


aacacaaaaa agtacggata atcaacaagt 


taagtctaca 


720 


gaaacatctc 


atcttcaaga 


aatcaacggt aaaacctatc ttcttgacga 


caatggtcaa 


780 


gttaaaaaga 


acttcaccgc 


tattattgac ggtaaagttc tatactttga 


taaaacatcc 


840 


ggcgaattga 


ccgcaaatgc 


accgcaagtt actaagggat tagtaaatat 


tgataatgea 


900 


cataacgcgg 


ctcatgatct 


cacagctgat aacttcacaa atgtcgatgg 


ttacttaaca 


960 


gctaacagtt 


ggcatcgtcc 


taaggacacc ttaaaaaacg gaacgacctg 


gacaccaaca 


1020 


acagcagaag 


attttcgacc 


attgctcatg tcttggtggc cggataagaa 


tacgeaggta 


1080 


gcttatctac 


aatatatgca 


atcagttggt atgctacctg acgatgttaa 


agtatcaaat 


1140 


gatgataata 


tgagcacatt 


gactgatgct gctatgactg ttcaaaagaa 


tatcgaatcg 


1200 


cgaattggtg 


tatctggaaa 


aactgattgg ctcaagcaag atatgaacaa 


actgattgat 


1260 


tcacaggcaa 


attggaatat 


tgatagtgaa tcaaagggta atgatcattt 


acagggtggg 


1320 


gcattgttat 


atgtgaatga 


tgacaaaaca cctaacgcga actcagatta 


ccgtctgtta 


1380 


aaccgtacac 


caaccaacca 


aaccggccaa attactgatc caagtaaaca 


aggtggatat 


1440 


gagatgttat 


tagctaatga 


tgttgataat tctaaccctg ttgtacaagc 


tgagcaattg 


1500 


aactggcttc 


actacatgat 


gaacattggt actatagctc agaacgaccc 


aacagctaat 


1560 


tttgacggtt 


atcgtgttga 


tgcggttgat aacgttgatg ccgatctctt 


acaaattget 


1620 


ggtgattact 


ttaaagctgc 


atacggtact ggtaaaactg aggcaaacgc 


aaacaatcat 


1680 


atttcgatct 


tggaagattg 


ggataataat gattctgcgt acattaaagc 


ecaegggaat 


1740 


aaccaattga 


caatggattt 


tccagcacac ttggctttga aatacgcctt 


gaacatgect 


1800 


cttgccgcac 


aaagtggcct 


agaaccgcta attaatacaa gtcttgttaa 


gcgtgggaaa 


1860 


gatgccacag 


aaaatgaagc 


acaaccaaac tatgccttta tccgtgccca 


tgatagtgaa 


1920 


gtgcagaccg 


ttattgcaca 


aattattaag gataaaatta acacaaaatc 


agaeggctta 


1980 


actgtaacac 


cagatgagat 


taagcaagct ttcactattt acaacgccga 


tgaattaaaa 


2040 


gcagataagg 


aatatacagc 


atacaatatt cctgcttctt acgctgtatt 


gttgacaaac 


2100 


aaggatactg 


tgccacgtgt 


ttattatggt gatctatttt ctgatgatgg 


acagtatatg 


2160 


tcacagaagt 


caccatacta 


tgacgccatt acgtcacttt tgaaaagccg 


tatcaaatat 


2220 


gttgctggtg 


gtcaaagtat 


gaatatgacg tacttgcatg agtgctttga 


tccagcaaaa 


2280 


aatgagacaa 


agccacaagg 


tgtcttaaca tcagtacgtt acggtaaagg 


tgcgatgacg 


2340 


gctgacgatt 


tgggtaatag 


tgacacacgt caacaaggta ttggtttggt 


gattaataat 


2400 


aagccattcc 


tgaatttaaa 


tgatgatgaa caaattgtgc tcaatatggg 


tgctgctcac 


2460 


aaaaatcaag 


cttaccgacc 


acttatgttg acaacaaaat ctggtcttca 


aatttacgat 


2520 


aaggatgccg 


gagcgccagt 


tgtttatact aacgatgctg gtcaacttat 


ttttaagtca 


2580 


gatatggtct 


atggtgtcag 


caatccacag gtatctggtt attttgctgc 


atgggtacca 


2640 


gtcggtgcga 


gtgatagtea 


agatgctaga acacaaagca gccagtcaga 


aactaaggat 


2700 


ggcgatgtct 


atcacccaaa 


tgctgcgctt gattctaatg tgatttatga 


aggcttctcg 


2760 


aattcccaag 


caatgcctga 


aaagaatgat gacttcacca acgtaaaaat 


tgctcaaaat 


2820 


gctaaactgt 


ttaaagattt 


agggattaca agctttgaat tagcaccgca 


atategttea 


2880 
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agtacagata 


atagtttttt 


ggattcggtt 


atccaaaacg 


getatgeett 


tactgatcga 


2940 


tatgatgttg 


gctataatac 


gecaacaaaa 


tatggtacag 


ttgatcaact 


tctagatagt 


3000 


ctaagagcat 


tacacgcaca 


aggtattcag 


gctattaatg 


actgggtacc 


tgatcaaatt 


3060 


tataatttac 


ctggcgaaca 


aatcgtcacc 


g cagt teg t a 


caaatggttc 


aggtaagtac 


3120 


gattatgatt 


cagtga tt a a 


taacacgctc 


tatgattcac 


gaacagttgg 


gggcggcgaa 


3180 


taccaagaaa 


aatttaatqq 


cctgttctta 


gaccagttga 


aaaaagatta 


tcctagcttg 


3240 


tttgaaacta 


age aga t a c c 


aacgaatcag 


ccgatgaatc 


cggatgttaa 


aattaaagaa 


3300 


tggtctgcaa 


agt act tt a a 


tggttcaaac 


attcaaggtc 


gtggcgcttg 


gtatgtactt 


3360 


aaaaactaaa 


caacaaat ca 


atatttcaat 


gtgtctaqtq 


ataatggatt 


ettgectaaa 


3420 


cagt tactgg 


gtgaaaaaac 


aagcaccggc 


tttataacag 


aaaatggtaa 


gacttctttc 


3480 


tactcaacaa 


gtggttatca 


agctaaagat 


acctttattc 


aagatggaac 


aaattggtat 


3540 


tactttgata 


atgeaggcta 


tatgttgaca 


ggtaaacaaa 


atatccacga 


taaaaattat 


3600 


tatttcttac 


ctaatggtgt 


ggaacttcaa 


gatgettace 


tttttgaegg 


taatcaagaa 


3660 


ttttactata 


ataaagctgg 


ggaacaagtt 


atgaaccagt 


attatcaaga 


tagtcaaaat 


3720 


caatggcatt 


atttctttga 


aaatggtcgc 


atggcaattg 


gectgacaga 


agttccgaac 


3780 


qctqatqqca 


cccatgttac 


acaatatttt 


gatgetaatg 


gtgtccaaac 


taaaggcaca 


3840 


gctataaaag 


atcagaataa 


teaattaege 


tattttgatg 


aggecacagg 


taatatggtg 


3900 


gttaat teat 


qqqqacaqt t 

3333 3 


agcagataag 


tcttggcttc 


accttaatgc 


acaaggegtt 


3960 


qctgtgactg 


gtaaccaaaa 


aattgatggt 


gaagagtact 


acttcaatgc 


tgatggtaag 


4020 


caagttaaag 


geaatgeaat 


catcgataat 


aatggtgatc 


aacgttatta 


tgatggtgat 


4060 


aaaaatatca 


tggtagttaa 


ttcatqqqgt 


qagttgccaq 


atggctcatg 


gttatatttg 


4140 


aatgacaaag 


gtattgctgt 


aacaggcegt 


caagtcatta 


ataatcaagt 


taatttcttt 


4200 


aataataata 


gtaagcaaat 


caaagatgee 


tttaaattat 


tatccgatgg 


ttcatgggtg 


4260 


tatttggatg 


ataagggect 


gataacaact 


ggagccaaag 


ttatcaatgg 


tctaaatatg 


4320 


ttttttgata 


aagaeggtea 


tcaaatcaaa 


ggtgatgcca 


geaeggatge 


caatggtaag 


4380 


cgecattatt 


atgacaaaaa 


tgatggtcat 


cttgtcacaa 


attcatgggg 


tgagttgcca 


4440 


gatggttcat 


ggttatatct 


agaagaacaa 


ggtgatgctg 


ttactggtca 


acgtgtgatt 


4500 


gatggcaaga 


caegctattt 


tgatgaagat 


ggcaaacaaa 


ttaaaaatag 


cctaaaaacg 


4560 


ctggccaatg 


gcgataagat 


ttatcttgat 


ggtgatgggg 


ttgccgcaac 


aggcttacaa 


4620 


catgtgggcg 


ataaaatcat 


gcat ttugai. 


gaagatggca 


aacaagttgt 


tggcaagtt t 




gtaccagcaa 


aagatggttc 


aeggtattae 


ttaaatcagg 


atggtgttgc 


cgcggttggt 


4740 


ccaagcagca 


ttaatggaca 


atcactttac 


tttgatcaag 


atggtaaaca 


agttaaatat 


4800 


aacgaagtcc 


gtaatagtga 


cggaacaacc 


aactattaca 


caggattaac 


gggtgaaaag 


4860 


ttaacgcaag 


acttcggtga 


actaccagat 


ggttcatgga 


tttatcttga 


tgcgcaaggt 


4920 


cacacagtaa 


ctggtgcaca 


aatcattaac 


ggtcaaaatc 


tttactttaa 


ggctgaegge 


4980 


cagcaagtta 


aaggtcatgc 


ttatactgac 


caattaggtc 


atatgcgttt 


ttatgatcct 


5040 


gattcaggtg 


atatgttgag 


taatcgcttt 


gaacaaatca 


cacctggtgt 


atgggcttac 


5100 


tttggtgctg 


atggtgtggc 


cataactgga 


caacatgaca 


taaatggtca 


gaagctattc 


5160 


tttgatgaga 


caggatatca 


agttaaaggt 


tegcaaegta 


caatagatgg 


tacgttatac 


5220 
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agcctcgatc 


cccaaactgg 


taaccaaaaa 


cgcgtacaga 


caacatcgtc 


gccacaagca 


5280 


ggtcactata 


tcacgaaaaa 


tggtaacgat 


tggcagtatg 


ataccaatgg 


tgaactagcg 


5340 


aagggtctgc 


gtcaagatag 


caacggtaag 


ttgcgttacc 


ctgatctgac 


aaccggcata 


5400 


caagcgaaag 


gccaatttgt 


tacaattggc 


caagaaactc 


attactttag 


taaagatcac 


5460 


ggggatgcgc 


agttattgcc 


aatggtcact 


gaagggcatt 


acggtacaac 


aacactcaag 


5520 


caaggtcaag 


acaccaaaac 


agcctgggtt 


taccgtgatc 


aaaabaatac 


tattttgaag 


SSBO 


ggaccgcaaa 


atatcaatgg 


cacgctgcaa 


ttctttgatc 


catatacagg 


tgaacaactt 


5640 


aagggtggcg 


tagcaaagta 


tgacgacaag 


ctcttttact 


ttgaatcagg 


taaaggtaat 


5700 


cttgttagca 


ccgtagcagg 


tgactatcag 


gatggtcatt 


atatttccca 


agatggccaa 


5760 


acacgttacg 


cagataagca 


aaatcagctt 


gtaaagggac 


ttgttactgt 


taatggggca 


5820 


ttacaatact. 


ttgataacgc 


tactggtaac 


caaataaaaa 


atcaacaagt 


tattgttgat 


5880 


ggcaagacgt 


actattttga 


cgataaaggc 


aatggtgaat 


acttattcac 


taatacatta 


5940 


gatatgtcta 


ctaatgcttt 


ccctaccaaa 


aatgttgcat 


tcaatcatga 


cagtagcagt 


6000 


ttcgaccata 


ctgttgatgg 


cttcttgacg 


gcagatactt 


ggtatcgacc 


aaagtcaatt 


6060 


ttggctaacg 


ggacaacttg 


gcgtgattcg 


actgataagg 


atatgcgacc 


attaatcact 


6120 


gtttggtggc 


caaataagaa 


tgttcaagtc 


aactacctca 


acttcatgaa 


agcaaatggc 


6180 


ttgttgacaa 


cagcagcaca 


atacacacta 


cattcagatc 


aatatgattt 


gaaccaagct 


6240 


gcacaagatg 


ttcaagtggc 


cattgaaagg 


cgcattgcgt 


cagagcatgg 


cacagactgg 


6300 


ttacagaaat 


tgttgtttga 


atcacaaaat 


aataacccat 


catttgtgaa 


gcaacaattc 


6360 


atttggaaca 


aggattctga 


atatcatggt 


ggtggtgatg 


cttggttcca 


aggtggttat 


6420 


ctgaagtatg 


gcaataaccc 


actcacacca 


acaactaatt 


ctgattatcg 


tcaacctggt 


6480 


aacgcatttg 


atttcttgct 


agccaacgac 


gtggataatt 


ctaatcctgt 


tgtgcaagct 


6540 


gaaaacttaa 


accggctaca 


ctacctaatg 


aactttggca 


ccatcactgc 


gggtcaagat 


6600 


gacgctaatt 


ttgatagtat 


tcgtattgac 


gctgtcgact 


ttattcataa 


tgatacaatc 


6660 


caacgtactt 


atgattatct 


tcgtgatgct 


tatcaagtgc 


aacaaagtga 


agccaaagca 


6720 


aaccagcaca 


tttcactggc 


cgaagctggc 


ttagacgcag 


gtacatcaac 


gattcataat 


6780 


gacgcgtcaa 


ttgagtcaaa 


cctccgtgaa 


gcagcgacat 


tgtcgttaac 


aaatgaacct 


6840 


ggtaaaaata 


aaccattgac 


gaatatgcta 


caagacgttg 


acggcggtac 


gcttatcacc 


6900 


gaccatacgc 


agaatagtac 


agaaaatcag 


gcgacaccaa 


actattcaat 


tattcacgcg 


6960 


cacgataaag 


gtgtgcaaga 


aaaagtaggt 


gcagccatta 


ctgatgctac 


tggtgctgat 


7020 


tggacgaact 


ttacagatga 


acagctaaaa 


gccggattag 


agctattcca 


taaggatcag 


7080 


cgcgcaacaa 


acaaaaagta 


taatagttat 


aacataccaa 


gtatttatgc 


cctgatgttg 


7140 


acaaacaaag 


atactgttcc 


tcgtacgtac 


tatggggata 


tgtaccaaga 


tgacggacag 


7200 


tatatggcaa 


acaagagtat 


ctactatgat 


gccttagtgt 


cattaatgac 


ggctcgtaaa 


7260 


agctatgtca 


gcggtggtca 


aactatgagt 


gttgacaatc 


atggcttgct 


gaagagtgtc 


7320 


cgttttggaa 


aagatgcgat 


gacagctaat 


gatttaggta 


catcagctac 


gcgcactgag 


7380 


ggtcctggcg 


tcattattgg 


taatgatcca 


aagttgcaac 


ttaatgattc 


ggataaagtg 


7440 


acactggata 


tgggtgcagc 


acataaaaat 


caaaagcatc 


gcgcagtcat 


cttaacaaca 


7500 


cgtgatggtt 


tggcaacctt 


taattcagat 


caagcaccaa 


cagcctggac 


aaacgatcaa 


7560 


ggaacgttaa 


cattctcaaa 


tcaagagaet 


aacgggcaag 


acaacacaca 


aatccgtggt 


7620 
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gttgctaatc 


cgcaagtttc 


tggttatcta 


qctqtttqqq 

3 3 333 


tgcctgtggg tgeatcagae 


7680 


aatcaagatg 


cccgtacagc 


agcaacgaca 


acagaaaatc 


atgatggtaa agtattacac 


7740 


t ca aatacaa 
y ^? y y 


cat tagattc 


t aacc ttatt 


tetgaaggt t 


t ctctaact t ccaacctaag 


7800 


gcaacaacgc 


atgatgaact 


t acgaacgt t 


gtaat tgc t a 


aaaatgeega tgtcttcaat 


7860 


aat tggggta 


ttacgagttt 


tgaaa tggca 


ccacagt acc 


gttcaagtgg ggaccat aca 


7920 


ttcttggatt 


caacgattga 


taatggttat 


gcctccactg 


ategctatga cttaggtttc 


7980 


aatacaccaa 


caaagtatgg 


cactgatggt 


gatttgcgtg 


caacgattca agegctacat 


8040 


catgctaata 


tgcaagttat 


ggctgacgtt 


gttgataacc 


aggtctataa cttacctggt 


8100 


aaagaagttg 


tttcagcaac 


acgagcaggt 


gtttatggta 


atgacgaege caegggcttt 


8160 


ggaacgcaac 


tctatgtgac 


taactccgct 


ggtggtggtc 


aataccaa^a (jaa<tLat(ji.L 


8220 


ggacaatact 


tagaagctct 


gaaagcaaag 


tatccagacc 


t ct t tgaggg taaggectat 


8280 


gattattggt 


ataagaacta 


tgcaaatgat 


gggtcaaacc 


cttactatac attgtcacac 


8340 


ggtgaccgtg 


aatctatccc 


agcagatgtt 


gctattaagc 


aat ggt cage taagtatatg 


8400 


aacggcacga 


acgttttggg 


caatggtatg 


ggttatgtat 


tgaaggattg gcataatggt 


8460 


caatatttca 


agcttgatgg 


tgataaatca 


acattacctc 


aaattt 


8506 


<210> SEQ ID NO 5 
<211> LENGTH: 8931 
<212> TYPB: DNA 

<213> ORGANISM: Leuconostoc mesent eroides 
<220> PEATURE: 

<223> OTHER INFORMATION : Gene dor-E 






<400> SEQUENCE: 5 










aataatctgt 


ctccattgct 


ttcaaaataa 


taatagttaa 


ttattatcat ggaacaatca 


60 


atattttatt 


tatattcact 


attgaatatc 


cttttttgca 


taaatctcta gagecgattt 


120 


tttgggttat 


acaatgaatt 


ggtaaaggct 


aatcattttt 


acaaaacca t ggtggtt ttt 


180 


tattttttcc 


aaaattaccg 


aactagagga 


agagaaaagg 


agcaatagt t gtatgagaga 


240 


catgagggta 


atttgtgacc 


gtaaaaaatt 


gtacaaatcg 


ggcaaagtac tagtaacagc 


300 


cggtattttt 


gctttgatga 


tgtttggcgt 


cacaactgct 


agtgt tagtg caaa t acga t 


360 


tgcagttgac 


acgaatcata 


gccgtacttc 


age ac agate 


aataagagtg ccgttgataa 


420 


ggttaatgat 


gacaagacta 


ctttaggagc 


ggcaaaagta 


gtggcagtag ccacaacgcc 


480 


agcgacaccg 


gtagcagaca 


aaacagtaag 


tgcacccgca 


gcagataagg cagtagatac 


54 0 


aacgtcatca 


acgacacctg 


caacggataa 


ggcagtagat 


acaacgccaa cgacacctgc 


600 


agcagataag 


gcagtagata 


caacgccaac 


gacacctgca 


gcagataagg cagtagatac 


660 


aacgccaacg 


acacctgcag 


caaataaagc 


agtagataca 


acgccagcga ccgctgcaac 


72 0 


agataaggcg 


gtagccacgc 


cagccacacc 


tgcagcagat 


aagctagcaa atacgacgcc 


780 


tgcaacggac 


aaggcagtag 


ccacaacgcc 


agcgacgccg 


gtagcaaata aagcagcaga 


840 


cacgagtagt 


attcatgatc 


aaccattaga 


tacaaatgtg 


ccaactgata aatcagcaaa 


900 


cctcgtctcg 


acaacacaaa 


aaagtacgga 


taatcaacaa 


gttaagtcta cagaaacatc 


960 


tcatcttcaa 


gaaatcaacg 


gtaaaaccta 


ttttcttgac 


gacaatggtc aagttaaaaa 


1020 


gaacttcacc 


gcCattattg 


acggtaaagt 


tctatacttt 


gataaaacat ceggegaatt 


1080 


gaccgcaaac 


gcaccgcaag 


ttactaaggg 


attagtaaat 


attgataatg cacataacgc 


1140 
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ggctcatgat 


ctcacagctg 


ataactccac 


aaatgtcgac 


ggttacctaa 


cagctaacag 


1200 


ttggtatcgt 


cctaaggaca 


tcttaaaaaa 


cggaacgacc 


tggacaccaa 


caacagcaga 


1260 


agattttcga 


ccattgctca 


tgtcttggtg 


geeggataag 


aataegcagg 


tagcttatct 


1320 


acaatatatg 


caatcagttg 


gtatgetace 


tgacgatgtt 


aaagtatcaa 


atgatgataa 


1360 


tatgagcaca 


ttgactgatg 


ctgetatgae 


tgttcaaaag 


aatatcgaat 


cgcgaattgg 


1440 


tgtatctgga 


aaaactgatt 


ggctcaagca 


agatatgaac 


aaactgattg 


attcacaggc 


1500 


aaattggaat 


attgatagtg 


aaccaaaggg 


taatgatcat 


ttacagggtg 


gggcattgtt 


1560 


atatgtgaat 


gatgacaaaa 


cacctaacgc 


gaactcagat 


tacegtctgt 


taaacegtae 


1620 


accaaccaac 


caaaccggcc 


aaattactga 


tccaagtaaa 


caaggtggat 


atgagatget 


1680 


attagctaat 


gatgttgata 


attctaaccc 


tgttgtacaa 


gctgagcaat 


tgaactggct 


1740 


tcactacatg 


atgaacattg 


gtaccatagc 


tcagaacgac 


ccaacagcta 


attttgaegg 


1800 


ttatcgtgtt 


gatgcggttg 


ataacgttga 


tgccgatctc 


ttacaaattg 


ctggtgatta 


1860 


cttcaaagct 


geataeggta 


ctggtaaaac 


tgaggcaaac 


gcaaacaatc 


atatttcgat 


1920 


cttggaagat 


tgggataata 


atgattctgc 


gtacattaaa 


gcccacggga 


ataaccaatt 


1980 


gacaatggat 


tttccagcac 


acttggcttt 


gaaatacgee 


ttgaacatgc 


ctcttgccgc 


2040 


acaaagtggc 


ctagaaccgc 


taattaatac 


aagtcttgtt 


aagcgtggga 


aagatgecac 


2100 


agaaaatgaa 


gcacaaccaa 


actatgeett 


tatccgtgcc 


catgatagtg 


aagtgcagac 


2160 


cgttattgca 


caaattatta 


aggataaaat 


taacacaaaa 


teagaegget 


taactgtaac 


2220 


accagatgag 


attaagcaag 


ctttcactat 


ttacaacgcc 


gatgaattaa 


aagcagataa 


2280 


ggaatataca 


gcatacaata 


ttcctgcttc 


ttacgctgta 


ttgttgacaa 


acaaggatac 


2340 


tgtgccacgt 


gtttattatg 


gtgatctatc 


ttctgatgat 


ggacagtata 


tgtcacagaa 


2400 


gtcaccatac 


tatgacgeca 


ttaegtcact 


tttgaaaagc 


cgtatcaaat 


atgttgctgg 


2460 


tggtcaaagt 


atgaatatga 


egtacttgea 


tgagtgcttt 


gatccagcaa 


aaaatgagac 


2520 


aaagccacaa 


ggtgtcttaa 


catcagtacg 


ttacggtaaa 


ggtgcgatga 


eggctgaega 


2580 


tttgggtaat 


agtgacacac 


gtcaacaagg 


tattggtttg 


gtgattaata 


ataagecate 


2640 


cttgaattta 


aatgatgatg 


aacaaattgt 


gctcaatatg 


ggtgctgctc 


acaaaaatca 


2700 


agcttaccga 


ccacttatgt 


tgacaacaaa 


atctggtctt 


caaatttacg 


ataaggatgc 


2760 


cggagcgcca 


gttgtttata 


etaacgatge 


tggtcaactt 


atttccaagt 


cagacatggc 


2820 


ctatggtgtc 


agcaatccac 


aggtatctgg 


ttattttget 


gcatgggtac 


cagtcggtgc 


2880 


gagtgatagt 


caagaegcta 


gaacacaaag 


cagccagtca 


gaaactaagg 


atggcgatgt 


2940 


ctatcattca 


aatgetgege 


ttgactctaa 


tgtgatttat 


gaaggcttct 


cgaatttcca 


3000 


ageaatgect 


gaaaagaatg 


atgacttcac 


caaegtaaaa 


attgetcaaa 


atgetaaatt 


3060 


gtttaaagat 


ttagggatta 


caagctttga 


attagcaccg 


caacatcgct 


caagtacaga 


3120 


taatagtttt 


ttggattcgg 


ttatccaaaa 


cggctatgcc 


tttactgatc 


gatatgatgt 


3180 


tggctataat 


acgccaacaa 


aatatggtac 


agttgatcaa 


cttctagata 


gtctaagagc 


3240 


attacacgea 


caaggtattc 


aggctattaa 


tgactgggta 


cccgatcaaa 


tttataattt 


3300 


acccggcgaa 


caaategtea 


ccgcagttcg 


tacaaatggt 


tcaggcaagt 


acgactatga 


3360 


ttcagtgatt 


aacaacacgc 


tccacgatcc 


acgaacagct 


gggggcggcg 


aataccaaga 


3420 


aaagtttggt 


ggcctgtcct 


tagaccagct 


gaaaaaagac 


taccccagct 


tgtttgaaac 


3480 


taagcagaca 


tcaacgaacc 


agecgatgaa 


tccggatgtt 


aaaattaaag 


aatggtctgc 


3540 
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aaagtact t t 


aatggttcaa 


acactcaagg 


tcqtqqcqct 


tggtatgtac 


ttaaagactg 


3600 




caa tat tt ca 


atgtgtctag 


tgataatgga 


ttc tt gect a 


aacagttact 


3660 


aaataaaaaa 


acaagcaccg 


get ttataac 


agaaaatggt 


aagacttct t 


tctactcaac 


3720 




caagct aaag 


a tacc tttat 


tcaagatgga 


aca aat tggt 


attactttga 


37S0 


^ J 1 J ^y y 


t at atg ttga 


caggt aaaca 


aaa tatccac 


gat aaaaatt 


attatttctt 


3840 


<A<-C tod i_yy u 




a a« a t~ ctcT f f* 
aayatyvtua 


cctttttgat 


aat aatcaaa 
yy uaa 


aattttacta 


3900 


LaaLau aye L 


gS99 aacaa y 


t ta t gaacca 


gtattat caa 


gat agt caa a 


at caa tggca 


3960 


ttflttLCLll 


yaaoaLyy t.v» 


gca tggcaa t 


tggee tgaca 


gaagt tccga 


acacCaataa 

avyv,uyauyy 


4 020 


caccca tgt t 


ata^aci Ur c* i, l. 


t tgacget aa 


tggtgtccaa 


att aaaggca 


cagctataaa 


4080 


agat cagaat 




acT a £ tttoa 
y^vawuvwya 


taaaaccaca 


ggt aatatgg 


tggttaattc 


4140 


ataaaoacaa 
a ^-yy yy *»v»«y 


ttagcagata 


agtct tggct 


ttaccttaat 


qcacaaqqcq 


ttgctgtgac 


4200 


tggc aaccaa 


a aa at tgatg 


gtgaagag ta 


ctacttcaat 


qctqatqqta 


agcaagttaa 


4260 


aggcaatgca 


atcatcgata 


ataatggtga 


teaaegttat 


tatqatgqtq 

3 33 w 3 


ataagggtgt 


4320 




aat tcatggg 


a ta aat tacc 
y ^y^y kL 3 Lk ' 


agatggctca 


tggttatatt 


tgaatgacaa 


4380 


aggt at tgc t 


gtaacaggee 


gtcaagtcat 


taataatcaa 


gtcaatttcc 


ttggtaatga 


4440 


t- nrif a anr^A 
t-yy !• <itiy\„<»«i 




cct ttaaatt 


at tatccgat 


aat teat aaa 


tgtat ttgga 


4B00 


tgat aagggc 


ctgataacaa 


c tggagccaa 


agt tatcaat 


ggt ctaaata 


tgttttttga 


4560 


taaaaacaat 


cat caa at ca 


aaaataatac 
a **y y *- y ** *- y 


cagcaeggat 


gccaatggta 


agegecatta 


4620 


t" raraacaaa 




ate t t gt cac 


aaattcatgg 


aataaat tac 
y y *" y a y ** y w 


cagatggttc 


4680 


d i»yy cLauoL 


ctagaagaac 




tgt t actggt 


caacgtgtga 


t tgatggcaa 


4740 




l. LLyawyooy 


aeggcaaaca 


aat t aaa aat 


agectaaaaa 


cgc tggccaa 


4800 


*33^y ^ y 


atttatct tg 


atggt gatgg 


aat tact aca 
yy *• L y y 


acaggct tac 


aacatgtggg 


4860 


tyaL del dd 




ataaaoatao 


caaacaagt t 


attaacaaat 


t tgt at cage 


4920 


aaaaga tggt 


t catggtat t 


act t a aat c a 


aaa t aat at t 
yy***-yy | -y 1 - 1 - 


accacaatta 
y *-*-y«-yy *- «-y 


gtccaagcag 


4980 


catt aatgga 


caa tea ct tt 


act ttgatca 


aga tggtaaa 


caagttaaat 


ataatgaagt 


5040 


t cat aataat 


gatggaacaa 


ccaactatta 


cacaggatta 


acqqqtqaaa 

333 3 


agt taacgea 


5100 


agac t teggt 


gaactaccag 


atggttcatg 


gatttatctt 


qatqeqeaaq 

3 3 3 3 


gtcatacagt 


5160 


aactgg tgea 


caaatcat ta 


aeggtcaaaa 


tctttacttt 


aaqqctqacq 

3J -J Ij 


gecagcaagt 


5220 


f aaana teat 


get tat ac tg 


accaattagg 


teatatgegt 


ttttatgatc 


c tgattcagg 


5280 


tgatatgtcg 


agtaatcget 


ttgaacaaat 


cacacctggt 


gtatgggctt 


actttggtgc 


534 0 


tgatggtgcg 


gecataactg 


gacaacatga 


cataaatggt 


cagaagctat 


cctttgatga 


5400 


gacaggatac 


caagttaaag 


gttcgcaacg 


tacaatagat 


ggtacgttat 


acagcttcga 


5460 


ttctcaaact 


ggtaaccaaa 


aacgegtaca 


gacaacattg 


ttgccacaag 


caggtcacta 


5520 


tatcacgaaa 


aatggtaacg 


attggcagta 


tgataccaat 


ggtgaactag 


cgaagggtct 


5580 


gegtcaagat 


agcaatggta 


agetgegtta 


ctttgatttg 


acaaccggca 


tacaagegaa 


5640 


aggceaactt 


gttacaattg 


gecaagaaac 


ttattacttt 


agtaaagatc 


aeggggatge 


5700 


gcagttattg 


ccaatggtca 


ctgaagggca 


ttaeggtaca 


ataacactca 


agcaaggtca 


5760 


agacaccaaa 


acagcctggg 


tttaccgtga 


tcaaaataat 


actattttga 


agggattgea 


5820 


aaataccaat 


ggcacgttgc 


aattctttga 


tccatataca 


ggtgaacaac 


ttaagggtgg 


5880 
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cgtagcaaag tacgacgaca agctccttta ctttgaatca ggtaaaggta atcttgttag 5940 

caccgtagca ggtgactatc aggatggtca ttatatttcc caagatggcc aaacacgtta 6000 

cgcagataag caaaatcagc ttgtaaaggg acttgttact gttaatgggg cattacaata 6060 

ctttgataac gctactggta accaaataaa aaatcaacaa gttattgttg atggcaagac 6120 

gtactatttt gacgataaag gcaatggtga atacttattc actaatacat tagatatgtc 6180 

tactaatgct ttttctacca aaaatgttgc attcaatcat gacagtagca gtttcgacca 624 0 

tactgttgat ggcttcttga cggcagatac ttggtatcga ccaaagtcaa ttttggctaa 6300 

cgggacaact tggcgtgatt cgactgataa ggatatgcga ccattaatca ctgtttggtg 6360 

gccaaataag aatgttcaag tcaactacct caacttcatg aaagcaaatg gcttgttgac 6420 

aacagcagca caatacacac tacattcaga tcaaeatgat ttgaaccaag ctgcacaaga 6480 

tgttcaagtg gccatcgaaa ggcgcattgc gtcagagcat ggcacagacc ggttacagaa 6540 

attgttgttt gaatcacaaa ataataaccc atcatttgtg aagcaacaat tcatttggaa 6600 

caaggattct gaatatcatg gtggtggtga tgcttggttc caaggtggtt atctgaagta 6660 

tggcaataac ccactcacac caacaactaa ttctgattat cgtcaacctg gtaatgcatt 6720 

tgatttcttg ctagccaacg acgtggataa ttctaatcct gttgtgcaag ctgaaaactt 6780 

aaactggtta cattacttaa tgaactttgg caeca tcact gcgggtcaag atgacgctaa 6840 

ttttgatagt attegtattg aegctgtega ctttattcat aatgatacaa tccaacgtac 6900 

ttatgattat cttcgtgatg cttatcaagt gcaacaaagt gaagccaaag caaaccagca 6960 

catttcattg gttgaagctg gettagaege aggtacatca acgattcata atgatgcgtt 7020 

aattgagtca aacctccgtg aagcagegae attgtcgtta acaaatgaac ctggtaaaaa 7080 

taaaccattg aegaatatge tacaagacgt tgacggcggt aegcttatea ccgaccatac 7140 

gcagaatagt acagaaaatc aggcgacacc aaactattca attattcacg cgcacgataa 7200 

aggtgtgcaa gaaaaagtag gtgeagecat tactgatget actggtgctg attggacgaa 7260 

ctttacagat gaacagttaa aagceggatt agagctattc tataaggatc agcgcgcaac 7320 

aaacaaaaag tataatagtt ataacatacc aagtatttat gecctgatgt tgacaaacaa 7380 

agatactgtt ccccgtacgt attatgggga tatgtatcaa gatgaeggae agtatatggc 744 0 

aaacaagagt atctactacg atgccttagt gtcattaatg aeggctegta aaagctatgt 7500 

cagcggtggt caaaccatga gtgttgacaa tcatggtttg ttgaagagtg tccgttttgg 7560 

aaaagatgcg atgacagcta atgatttagg cacatcagcc aegegtactg agggtcttgg 7620 

tgtcattatt ggtaacgatc caaagttgea actcaatgat teggacaaag tgacactgga 7680 

tatgggtgca gcacataaaa atcaaaagta tegegcagtt atcttaacaa cacgtgatgg 7740 

ttcggcaacc tttaattcag accaagcacc aacagcttgg acaaacgacc aaggaacgtt 7800 

aacattctca aatcaagaga ttaaegggea agacaataca caaattegtg gtgttgctaa 7860 

tccgcaagct tccggctatc tagctgtttg ggtgcctgtg ggtgcaccag acaatcaaga 7920 

tgcccgtaca gcagcaacga caacagaaaa tcatgatggt aaagtattac actcgaacgc 7980 

ggcattagat tccaacccca tttacgaagg tctctctaac ccccaaccca aggcaacaac 8040 

gcatgatgaa cttacgaacg ttgtaattgc taaaaatgee gatgtcttca ataattgggg 8100 

tattacgagc ettgaaaegg caccacagta ccgctcaagc ggggaccata cattcttgga 8160 

ttcaacgatt gataatggtt atgccttcac tgategctat gacttaggtt tcaatacacc 8220 

aacaaagtat ggcactgatg gtgatttgcg tgcaacgact caagcgctac ateatgetaa 8280 
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tatgcaagtt 


atggctgacg 


ttgctgataa 


ccaggtctat 


aacttacctg 


gtaaagaagt 


8340 


tgtt tcagca 


acacgagcag 


ytgcc catgy 


taatgacgac gccacgggct 


t tggaacgca 


8400 


actctatgtg 


actaactccg 


ttggtggtgg 


tcaataccaa 


gagaaatatg 


ctggacaata 


8460 


cttagaagct 


ctgaaagcaa 


agtatccaga 


cctctttgag 


ggtaaggcct 


atgattattg 


8520 


gtataagaac 


tatgcaaatg 


atgggtcaaa 


tccttactat 


acattgtcac 


acggtgaccg 


8580 


tgaatctatc 


ccagcagatg 


ttgctattaa 


gcaatggcca 


gctaagtata 


tgaacggcac 


8640 


gaacgctttg 


ggcaatggta 


tgggttatgt 


attgaaggat 


tggcataacg 


gtcaatattt 


8700 


caagcttgat 


ggtgataaat 


caacattacc 


tcaaatttaa 


tttattttga 


tagggaacga 


8760 


ttatcttatc 


aaattgtagt 


gacaaaagcc 


gcagatattg 


aatccaatat 


ctgcgactcc 


8820 


tcgtctgtaa 


agctatgcta 


taataacgtt 


atgacaaaag 


aaaattattt 


taaagttggc 


8880 


acaattgtca 


acacccacgg 


tattcgtggc 


gaagtgaaga ttatggatat 


c 


8931 



<210> SEQ ID NO 6 
<211> LENGTH: 17 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 6 

Ala Asn Trp Asn lie Asp Ser Glu Ser Lys Gly Asm Asp His Leu Gin 
15 10 IS 

Gly 



<210> SEQ ID NO 7 
<211> LENGTH: 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 7 

Gly Gly Tyr Glu Met Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
15 10 15 

Val -Val Gin Ala Glu Gin Leu Aon 
20 



<210> SEQ ID NO 8 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 8 

Ala Asn Phe Asp Gly Tyr Arg Val Asp Ala Val Asp Asn Val Asp Ala 
15 10 15 

Asp Leu Leu Gin lie 
20 



<210> SEQ ID NO 9 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 



<400> SEQUENCE : 9 



His lie Ser lie Leu Glu Asp Trp Asp Asn Asn Asp 
IS 10 



<210> SEQ ID NO 10 
<211> LENGTH: IS 
<212> TYPE: PRT 
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<213> ORGANISM: Leuconostoc me sent ero ides 
<4 0O> SEQUENCE: 10 

Tyr Ala Phe He Arg Ala His Asp Ser Glu Val Gin Thr Val He 
15 10 15 

<210> SEQ ID NO 11 
<211> LENGTH: 8 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc raesenteroides 

<400> SEQUENCE: 11 

Asp Trp Val Pro Asp Gin He Tyr 
1 5 

<210> SEQ ID NO 12 
<211> LENGTH: 19 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc raesenteroides 
<400> SEQUENCE: 12 



Phe He Trp Asn Lys Asp Ser Glu Tyr His Gly Gly Gly Asp Ala Trp 
15 10 15 

Phe Gin Gly 



<210> SEQ ID NO 13 
<211> LENGTH : 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 13 

Asn Ala Phe Asp Phe Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
15 10 15 

Val Val Gin Ala Glu Asn Leu Asn 
20 



<210> SEQ ID NO 14 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 14 

Ala Asn Phe Asp Ser He Arg He Asp Ala Val Asp Phe He His Asn 
15 10 15 



Asp Thr He Gin Axg 
20 



<210> SEQ ID NO IS 
<211> LENGTH: 8 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 15 

His He Ser Leu Val Glu Ala Gly 
1 5 



<210> SEQ ID NO IS 

<211> LENGTH: 15 

<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 



<400> SEQUENCE: IS 

Tyr Ser He He His Ala His Asp Lys Gly Val Gin Glu Lys Val 
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10 15 



<210> SEQ ID NO 17 
<211> LENGTH: 8 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 17 

Asp Val Val Asp Asn Gin Val Tyr 
1 5 



<210> SEQ ID NO 18 

<211> LENGTH j 7 

<212> TYPE : PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 18 

Phe Tyr Phe Glu Ser Gly LyB 
1 5 



<210> SEQ ID NO 19 

<211> LENGTH: 8 

<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 19 

Phe Glu Ser Gin Asn Asn Asn Pro 
1 5 



<210> SEQ ID NO 20 

<211> LENGTH: 21 

<212> TYPE : DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION s Synthetic PCR primer sequence 

<400> SEQUENCE: 20 
ttytayttyg artcaggsaa r 



<210> SEQ ID NO 21 

<211> LENGTH: 21 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 

<400> SEQUENCE: 21 

ttytayttyg aragcggsaa r 21 



<210> SEQ ID NO 22 
<211> LENGTH: 24 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<400> SEQUENCE: 22 

kggrttrttr ttttgtgayt caaa 24 



<210> SEQ ID NO 23 
<211> LENGTH: 24 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : Synthetic PCR primer sequence 
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<400> SEQUENCE: 23 

kggrttrttr ttttggctyt caaa 24 

<210> SEQ ID NO 24 
<211> LENGTH: 30 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<400> SEQUENCE: 24 

ccctttacaa gctgattttg cttatctgcg 30 

<210> SEQ ID NO 25 
<211> LENGTH: 32 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<400> SEQUENCE: 2 5 

gggtcaaatc cttactatac attgtcacac gg 32 



<210> SEQ ID NO 2 6 

<211> LENGTH: 45 

<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 

<400> SEQUENCE: 2 6 

agttgtatga gagacatgag ggtaatttgt gaccgtaaaa aattg 45 



<210> SEQ ID NO 27 
<211> LENGTH: 46 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<400> SEQUENCE: 27 

atttgaggta atgttgattt atcaccatca agcttgaata ttgacc 46 



<210> SEQ ID NO 28 
<211> LENGTH: 30 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> PEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<400> SEQUENCE: 28 

gccatggcaa atacgattgc agttgacacg 30 



<210> SEQ ID NO 29 
<211> LENGTH: 31 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<400> SEQUENCE: 29 

gccatggacg gtaaaaccta ttttcttgac g 31 



<210> SEQ ID NO 30 
<211> LENGTH: 27 
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<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<400> SEQUENCE: 30 

tccatgggtg aaaaaacaag caccggc 27 

<210> SEQ ID NO 31 
<211> LENGTH: 23 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION : Synthetic PCR primer sequence 
<400> SEQUENCE: 31 

accatggata tgtctactaa tgc 23 

<210> SEQ ID NO 32 
<211> LENGTH: 21 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<400> SEQUENCE: 32 

taactgttta ggcaagaatc c 21 

<210> SEQ ID NO 33 
<211> LENGTH; 27 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> PEATURE: 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<4O0> SEQUENCE: 33 

taatgtatta gtgaataagt attcacc 27 

<210> SEQ ID NO 34 
<211> LENGTH: 24 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE; 

<223> OTHER INFORMATION: Synthetic PCR primer sequence 
<4O0> SEQUENCE: 34 

aatttgaggt aatgttgatt tatc 24 



<210> SEQ ID NO 35 
<:211> LENGTH: 6 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Consensus sequence derived from Leuconostoc 
me sent ero ides 

<400> SEQUENCE: 35 

Phe He His Asn Asp Thr 
1 5 



<210> SEQ ID NO 36 
<211> LENGTH: 7 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> PEATURE: 

<223> OTHER INFORMATION: Consensus sequence derived from Leuconostoc 
me sent ero ides 
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<400> SEQUENCE: 36 

Lys Gly Val Gin Glu Lys Val 
1 5 



<210> SEQ ID NO 37 
<211> LENGTH : 7 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Consensus sequence derived from Leuconostoc 
mesenteroides 

<400> SEQUENCE: 37 

Asn Val Asp Ala Asp Leu Leu 
1 5 



<210> SEQ ID NO 3 8 

<211> LENGTH: 7 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION : Consensus sequence derived from Leuconostoc 
mesenteroides 

<400> SEQUENCE : 3 8 

Ser Glu Val Gin Thr Val He 
1 B 



<210> SEQ ID NO 39 

<211> LENGTH: 32 

<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Consensus sequence derived from Leuconostoc 

mesenteroides 

<220> FEATURE: 

<221> NAME/KEY: raisc_f eacure 

<222> LOCATION: (6) . . (6> 

<223> OTHER INFORMATION: Xaa can be any naturally occurring amino acid 

<220> FEATURE: 

<221> NAME/KEY: misc_feacure 

<222> LOCATION: (28) . . (28) 

<223> OTHER INFORMATION: Xaa can be any naturally occurring amino acid 

<400> SEQUENCE: 39 

Trp Trp Tyr Phe Asn Xaa Asp Gly Gin Ala Ala Thr Gly Leu Gin Thr 
1*5 10 15 



He Asp Gly Gin Thr Val Phe Asp Asp Asn Gly Xaa Gin Val Lys Gly 
20 25 30 



<210> SEQ ID NO 40 
<211> LENGTH: 48 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Consensus sequence derived from Leuconostoc 
mesenteroides 

<400> SEQUENCE: 40 

Val Asn Gly Lys Thr Tyr Tyr Phe Gly Ser Asp Gly Thr Ala Gin Thr 
15 10 IS 

Gin Ala Asn Pro Lys Gly Gin Thr Phe Lys Asp Gly Ser Gly Val Leu 
20 25 30 



Arg Phe Tyr Asn Leu Glu Gly Gin Tyr Val Ser Gly Ser Gly Trp Tyr 
35 40 45 
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<210> SEQ ID NO 41 
<211> LENGTH: 20 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Consensus sequence derived from Leuconostoc 
mesenteroides 

<400> SEQUENCE: 41 

Asp Gly Lys lie Tyr Phe Phe Asp Pro Asp Ser Gly Glu Val Val Lys 
15 10 IS 

Asn Arg Phe Val 
20 



<210> SEQ ID NO 42 
<211> LENGTH: 14 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 

<220> FEATURE: 

<223> OTHER INFORMATION: Consensus sequence derived from Leuconostoc 
mesenteroides 

<400> SEQUENCE: 4 2 

Gly Gly Val Val Lys Asn Ala Asp Gly Thr Tyr Ser Lys Tyr 
15 10 



<210> SEQ ID NO 43 
<211> LENGTH : 12 
<212> TYPE : PRT 

<213> ORGANISM: Artificial Seqaenca 
<220> FEATURE: 

<223> OTHER INFORMATION: Consensus sequence derived from Leuconostoc 

me sent ero ides 
<220> FEATURE: 

<221> NAME/KEY: misc_f eature 
<222> LOCATION: (4) . . (4) 

<223> OTHER INFORMATION : Xaa can be any naturally occurring amino acid 
<220> FEATURE: 

<221> NAME /KEY : misc_f eature 
<222> LOCATION: <6) . . (6) 

<223> OTHER INFORMATION: Xaa can be any naturally occurring amino acid 
<220> FEATURE: 

<221> NAME /KEY: misc_f eature 
<222> LOCATION: (9) . . (11) 

<223> OTHER INFORMATION : Xaa can be any naturally occurring amino acid 
<400> SEQUENCE: 43 

Tyr Tyr Phe Xaa Ala Xaa Gin Gly Xaa Xaa Xaa Leu 
IS 10 



<210> SEQ ID NO 44 
<211> LENGTH: 16 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic peptide 
<400> SEQUENCE: 44 

Tyr Tyr Phe Asp Asp Lys Gly Asn Gly Glu Tyr Cys Phe Thr Asn Thr 
15 10 15 



<210> SEQ ID NO 45 

<211> LENGTH : 38 

<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc raesenteroides 

<400> SEQUENCE: 45 



Met Phe Met lie Lyo Glu Arg Aon Val Arg Lyo Lyo Leu Tyr Lys ser 
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Gly Lys Ser Trp Val lie Gly Gly Leu lie Leu Ser Thr lie Met Leu 
20 25 30 

Ser Met Thr Ala Thr Ser 
35 



<210> SEQ ID NO 4 6 
<211> LENGTH: 32 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 4 6 

Met Pro Phe Thr Glu Lye Val Met Arg Lys Lys Leu Tyr Lys Val Gly 
15 10 15 

Lys Ser Trp Val Val Gly Gly Val Cys Ala Phe Ala Leu Thr Ala Ser 
20 25 30 



<210> SEQ ID NO 4 7 
<211> LENGTH: 38 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 4 7 

Met Lys Gin Gin Glu Thr Val Thr Arg Lys Lys Tyr Lys Ser Gly Lys 
IS 10 IS 

Val Trp Val Ala Ala Ala Thr Ala Phe Ala Val Leu Gly Val Ser Thr 
20 25 30 

Val Thr Thr Val His Ala 
35 



<210> SEQ ID NO 4 8 
<211> LENGTH: 45 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide ECHO-dir 
<400> SEQUENCE: 4 8 

agttgtatga gagacatgag ggtaatttgt gaccgtaaaa aattg 

<210> SEQ ID NO 4 9 
<211> LENGTH: 49 
<212> TYPE: DNA 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Synthetic oligonucleotide ECHO-inv-del 
<400> SEQUENCE: 4 9 

gtattagtga ataagtattc accattgcat ttatcgtcaa aatagtacg 



<210> SEQ ID NO 50 
<211> LENGTH: 13 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 50 

Ala Ala Lys Val Val Ala Val Ala Thr Thr Pro Ala Thr 
15 10 



<210> SEQ ID NO 51 

<211> LENGTH: 9 

<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
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<400> SEQUENCE: 51 

Pro Val Ala Asp Lys Thr Val Ser Ala 
1 5 



<210> SEQ ID NO 52 
<211> LENGTH: 14 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 52 

Pro Ala Ala Asp Lys Ala Val Asp Thr Thr Ser Ser Thr Thr 
15 10 



<210> SEQ ID NO 53 
<211> LENGTH: 13 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 53 

Pro Ala Thr Asp Lys Ala Val Asp Thr Thr Pro Thr Thr 
15 10 



<210> SEQ ID NO 54 
<211> LENGTH: 13 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 54 

Pro Ala Ala Asp Lys Ala Val Asp Thr Thr Pro Thr Thr 
1 5 10 



<210> SEQ ID NO 55 
<211> LENGTH: 13 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 55 

Pro Ala Ala Asp Lys Ala Val Asp Thr Thr Pro Thr Thr 
15 10 



<210> SEQ ID NO 56 
<211> LENGTH: 13 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 56 

Pro Ala Ala Asn Lys Ala Val Asp Thr Thr Pro Ala Thr 
15 10 



<210> SEQ ID NO 57 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE s 57 

Ala Ala Thr Asp Lys Ala Val Ala Thr Pro Ala Thr 
1 5 ■ 10 



<210> SEQ ID NO 58 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 



<400> SEQUENCE: SB 
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Pro Ala Ala Asp Lys Leu Ala Asn Thr Thr Ala Thr 
15 10 



<210> SEQ ID NO 59 
<211> LENGTH: 10 
<212> TYPE ; PRT 

<213> ORGANISM: LeuconoBtoc mesenteroides 
<400> SEQUENCE: 59 

Asp Lys Ala Val Ala Thr Thr Pro Ala Thr 
15 10 



<210> SEQ ID NO 60 
<211> LENGTH: 7 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 60 

Pro Val Ala Aon Lys Ala Ala 
1 5 



<210> SEQ ID NO 61 
<211> LENGTH: 13 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc meBenteroides 
<220> PEATURE: 

<221> NAME/KEY: misc_f eature 
<222> LOCATION: (12).. (12) 

<223> OTHER INFORMATION : Xaa can be Ala or lie 
<400> SEQUENCE: 61 

Pro Ala Ala Asp Lys Ala Val Asp Thr Thr Pro Xaa Thr 
1 5 10 



<210> SEQ ID NO 62 
<211> LENGTH: 17 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE : 62 

Ser Ala Trp Asn Ser Asp Ser Glu Lys Pro Phe Abp Asp His Leu Gin 
15 10 15 

Asn 



<210> SEQ ID NO 63 
<211> LENGTH: 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 63 

Gly Gly Tyr Glu Phe Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
15 10 15 

Val Val Gin Ala Glu Gin Leu Asn 

20 



<210> SEQ ID NO 64 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<4O0> SEQUENCE: 64 

Ala Asn Phe Asp Ser lie Arg Val Asp Ala Val Asp Asn Val Asp Ala 
15 10 15 
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Asp Leu Leu Gin lie 
20 



<210> SEQ ID NO 65 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroldes 
<400> SEQUENCE: 65 

His Leu Ser lie Leu Glu Ala Trp Ser Asp Asn Asp 
1 5 10 



<210> SEQ ID NO 66 
<211> LENGTH: 15 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesent oroides 
<400> SEQUENCE: 66 

Tyr Ser Phe He Arg Ala His Asp Ser Glu Val Gin Asp Leu He 
15 10 15 



<210> SEQ ID NO 67 
<211> LENGTH: 8 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc raesenteroides 

<400> SEQUENCE: 6 7 

Asp Trp Val Pro Asp Gin Met Tyr 
1 5 



<210> SEQ ID NO 68 
<211> LENGTH: 17 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesent ero ides 
<400> SEQUENCE: 68 

Pro Gin Trp Asn Gly Glu Ser Glu Lys Pro Tyr Asp Asp His Leu Gin 
1 5 10 15 

Asn 



<210> SEQ ID NO 69 
<211> LENGTH: 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroideo 
<400> SEQUENCE: 69 

Gly Gly Tyr Glu Leu Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
15 10 15 

He Val Gin Ala Glu Gin Leu Asn 
20 



<210> SEQ ID NO 70 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroideo 
<4 00> SEQUENCE: 70 

Ala Asn Phe Asp Ser He Arg Val Asp Ala Val Asp Asn Val Asp Ala 
15 10 15 

Asp Leu Leu Gin He 
20 



<210> SEQ ID NO 71 
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<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<4O0> SEQUENCE: 71 

His Val Ser lie Val Glu Ala Trp Ser Asp Asn Asp 
IS 10 



<210> SEQ ID NO 72 
<211> LENGTH: 15 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 72 

Tyr Ser Phe Ala Arg Ala His Asp Ser Glu Val Gin Asp Leu lie 
15 10 15 



<210> SEQ ID NO 73 
<211> LENGTH : 8 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 73 

Asp Trp Val Pro Asp Gin Met Tyr 
1 5 



<210> SEQ ID NO 74 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<4 00> SEQUENCE: 74 

Asn Gin Trp Ser lie Ala Ser Glu Asn Glu Thr Val Tyr Pro Asn Gin 
1 5 10 15 

Asp His Met Gin Gly 
20 



<210> SEQ ID NO 75 
<211> LENGTH: 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconootoc mesenteroides 

<400> SEQUENCE: 75 

Ala Gly Tyr Glu Leu Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
15 10 15 

Val Val Gin Ala Glu Gin Leu Asn 
20 



<210> SEQ ID NO 76 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 76 

Ala Asn Phe Asp Gly Val Arg Val Asp Ala Val Asp Asn Val Asn Ala 
15 10 15 

Aop Leu Leu Gin He 
20 



<210> SEQ ID NO 77 

<211> LENGTH: 12 

<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
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<400> SEQUENCE: 77 

His Leu Ser He Leu Glu Ala Trp Ser Gly Asn Asp 
15 10 

<210> SEQ ID NO 78 
<211> LENGTH: 15 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<4 00> SEQUENCE: 78 

Tyr Val Phe He Arg Ala Hia Asp Ser Glu Val Gin Thr Arg He 
15 10 15 

<210> SEQ ID NO 79 
<211> LENGTH: 8 
<212> TYPE; PRT 

<213> ORGANISM: Leuconostoc raesent eroideo 

<400> SEQUENCE: 79 

Asp Leu Val Pro Asn Gin Leu Tyr 
1 5 

<210> SEQ ID NO 80 
<211> LENGTH: 17 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<4 00> SEQUENCE: 80 

Pro Gin Trp Asn Glu Thr Ser Glu Asp Met Ser Asn Asp His Leu Gin 
15 10 15 



<210> SEQ ID NO 81 
<211> LENGTH; 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc me sent ero ides 
<400> SEQUENCE: 81 

Gly Gly Phe Glu Leu Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
1 5 10 15 

Val Val Gin Ala Glu Gin Leu Asn 
20 



<210> SEQ ID NO 82 
<211> LENGTH: 21 
<212> TYPE ; PRT 

<213> ORGANISM: Leuconostoc meaent ero idea 
<4 00> SEQUENCE : 82 

Ala Asn Phe Asp Gly He Arg Val Asp Ala Val Asp Asn Val Asp Ala 
15 10 IS 

Asp Leu Leu Gin He 
20 



<210> SEQ ID HO 83 

<211> LENGTH : 12 

<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc me sent ero ides 

<4 00> SEQUENCE: 83 



His Leu Ser He Leu Glu Asp Trp Ser His Asn Asp 
15 10 
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<210> SEQ ID NO 84 
<211> LENGTH: 15 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 84 

Tyr ser Phe Val Arg Ala His Asp Ser Glu Val Gin Thr Val lie 
15 10 IS 



<210> SEQ ID NO 85 
<211> LENGTH: 8 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 85 

Asp Trp Val Pro Asp Gin lie Tyr 
1 ~ 5 



<210> SEQ ID NO 86 
<211> LENGTH : 17 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE : 86 

Pro Asn Trp Asn lie Asp Ser Glu Ala Lys Gly Asp Asp His Leu Gin 
1 5 10 15 

Gly 



<210> SEQ ID NO 87 
<211> LENGTH: 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 87 

Gly Gly Phe Glu Leu Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
15 10 15 

Val Val Gin Ala Glu Gin Leu Asn 
20 



<210> SEQ ID NO 88 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 88 

Ala Asn Phe Asp Gly Tyr Arg Val Asp Ala Val Asp ABn Val Asp Ala 
IS 10 15 

Asp Leu Leu Gin He 
20 



<210> SEQ ID NO 89 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 89 

He Tyr Gin Phe Trp Lys Thr Gly Glu Met Lys He 
1 5 10 



<210> SEQ ID NO 90 
<211> LENGTH: 15 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
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<400> SEQUENCE: 90 

Tyr Ser Phe He Arg Ala His Asp Ser Glu Val Gin Thr He He 
15 10 15 



<210> SEQ ID NO 91 
<211> LENGTH: 8 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 91 

Asp Trp Val Pro Asp Gin He Tyr 
1 5 



<210> SEQ ID NO 92 
<211> LENGTH: 17 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<4 00> SEQUENCE: 92 

Pro Gin Trp Asn Met Ser Ser Glu Asp Pro Lys Asn Asp His Leu Gin 
15 10 15 

Asn 



<210> SEQ ID NO 93 
<211> LENGTH: 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE : 93 

Gly Gly Phe Glu Leu Leu Leu Ala Asn Asp Val Asp Asn Ser Asn Pro 
15 10 IS 

Val Val Gin Ser Glu Gin Leu Asn 
20 



<210> SEQ ID NO 94 
<211> LENGTH : 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<4 00> SEQUENCE: 94 

Ala Asn Phe Asp Gly He Arg Val Asp Ala Val Asp Asn Val Asp Ala 
15 10 15 

Asp Leu Leu Gin He 
20 



<210> SEQ ID NO 95 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 95 

His Leu Ser He Leu Glu Asp Trp Ser His Asn Asp 
15 10 



<210> SEQ ID NO 96 
<211> LENGTH: 15 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 96 

Tyr Ser Phe Val Arg Ala His Asp Ser Glu Val Gin Thr Val He 
15 10 15 
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<210> SEQ ID NO 97 
<211> LENGTH: 8 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 97 

Asp Trp Val Pro Asp Gin lie Tyr 

1 5 



<210> SEQ 3D NO 98 
<211> LENGTH: 20 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 

<400> SEQUENCE: 98 

Ala Asn Trp Asn Lys Gin Thr Glu Asp Glu Ala Phe Aep Gly Leu Gin 
1 S 10 15 

Trp Leu Gin Gly 
20 



<210> SEQ ID NO 99 
<211> LENGTH: 24 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 99 

Lys Gly Ser Glu Phe Leu Leu Ala Asn Asp lie Asp Asn Ser Asn Pro 
15 10 15 

He Val Gin Ala Glu Gin Leu Asn 
20 



<210> SEQ ID NO 100 
<211> LENGTH: 21 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 100 

Ala Asn Phe Asp Gly He Arg Val Asp Ala Val Asp Asn Val Asp Ala 
15 10 15 

Asp Leu Leu Lys He 
20 



<210> SEQ ID NO 101 
<211> LENGTH: 12 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc raesenceroides 
<400> SEQUENCE: 101 

His Leu Ser He Leu Glu Asp Trp Asn Gly Lys Asp 
IS 10 



<210> SEQ ID NO 102 
<211> LENGTH: IS 
<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc mesenteroides 
<400> SEQUENCE: 102 

Tyr Ser Phe Val Arg Ala His Asp Tyr Asp Ala Gin Asp Pro He 
15 10 15 



<210> SEQ ID NO 103 
<211> LENGTH: 8 
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<212> TYPE: PRT 

<213> ORGANISM: Leuconostoc meoent eroides 

<400> SEQUENCE: 103 

Asp Trp Val Pro Asp Gin lie Tyr 
1 5 



What is claimed is: 

1. An isolated nucleic acid comprising SEQ ID NO:4 or its 
full length complementary strand. 

2. An isolated nucleic acid comprising: 

a) sequence encoding a dextransucrase expressed by the 
plasmid pCR-Ty-dsrD deposited at the CNCM with 
accession number 1-2649; or 

b) a full length complementary sequence to the sequence in 
a). 

3. An expression vector comprising a nucleic acid accord- 
ing to claim 1 or claim 2. 

4. The expression vector according to claim 3, in which the 
nucleic acid is under the control of a sequence allowing its 
expression in prokaryotic or eukaryotic cells. 

5. A host cell transformed by a nucleic acid according to 
claim 1. 

6. A host cell transformed by a vector according to claim 3. 

7. The transformed host cell according to claim 5, selected 
from the group comprising E. coli, Leuconostocci, plants, 
Lactococci and Bacilli or yeasts. 

8. The transformed host cell according to claim 7, wherein 
said transformed host cell is a strain of E. coli deposited at the 
CNCM with accession number 1-2649. 



15 



20 



25 



30 



9. An isolated nucleic acid encoding an enzyme with gly- 
cosyltransferase activity that can form dextrans having 
a(l->2) linkages from saccharose, a-D-fluoroglucose, 
paranitrophenyl-a-D glucopyranoside, a-D-glucopyrano- 
side-a-D sorbofuranoside or 4-O-a-D galactopyranosylsu- 
crose and comprising at least one nucleotide sequence encod- 
ing a catalytic domain of SEQ ID NO:3 and located 3* of a 
sequence encoding a glucan binding domain. 

10. An isolated nucleic acid consisting of SEQ ID NO:4 or 
its full length complementary strand 

11. A host cell transformed by a nucleic acid according to 
claim 9. 

12. A host cell transformed by a nucleic acid according to 
claim 2. 

13 .A host cell transformed by an expression vector accord- 
ing to claim 4. 

14. The isolated nucleic acid molecule according to claim 
9, wherein the nucleotide sequence encoding the glucan bind- 
ing domain is between two nucleotide sequences encoding 
the catalytic domains. 
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MEMORANDUM OPINION 

Plaintiffs here take issue with the interpretation that 
the United States Patent and Trademark Office (PTO) has imposed 
upon 35 U.S.C. § 154, the statute that prescribes patent terms. 
Section 154(a) (2) establishes a term of 20 years from the day on 
which a successful patent application is first filed. Because 
the clock begins to run on this filing date, and not on the day 
the patent is actually granted, some of the effective term of a 
patent is consumed by the time it takes to prosecute the 
application. To mitigate the damage that bureaucracy can do to 
inventors, the statute grants extensions of patent terms for 
certain specified kinds of PTO delay, 35 U.S.C. § 154(b) (1) (A), 
and, regardless of the reason, whenever the patent prosecution 
takes more than three years. 35 U.S.C. § 154(b)(1)(B). 
Recognizing that the protection provided by these separate 
guarantees might overlap, Congress has forbidden double-counting: 
"To the extent that periods of delay attributable to grounds 



specified in paragraph (1) overlap, the period of any adjustment 
granted under this subsection shall not exceed the actual number 
of days the issuance of the patent was delayed." 35 U.S.C. 
§ 154(b) (2) (A) . Plaintiffs claim that the PTO has misconstrued or 
misapplied this provision f and that the PTO is denying them a 
portion of the term Congress has provided for the protection of 
their intellectual property rights. 

Statutory Scheme 
Until 1994, patent terms were 17 years from the date of 
issuance. See 35 U.S.C. § 154 (1992) ("Every patent shall 
contain ... a grant . . . for the term of seventeen .years . . . 
of the right to exclude others from making, using, or selling the 
invention throughout the United States. . . ."). In 1994, in 
order to comply with treaty obligations under the General 
Agreement on Tarrif f s and Trade (GATT) , the statute was amended 
to provide a 20-year term from the date on which the application 
is first filed. See Pub. L. No. 103-465, § 532, 108 Stat. 4809, 
4984 (1994). In 1999, concerned that extended prosecution delays 
could deny inventors substantial portions of their effective 
patent terms under the new regime, Congress enacted the American 
Inventors Protection Act, a portion of which — referred to as 
the Patent Term Guarantee Act of 1999 — provided for the 
adjustments that are at issue in this case. Pub. L . No. 106-113, 
§§ 4401-4402, 113 Stat. 1501, 1501A-557 (1999). 
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As currently codified, 35 U.S.C. § 154(b) provides 
three guarantees of patent term, two of which are at issue here. 
The first is found in subsection (b) (1)(A), the " [g] uarantee of 
prompt Patent and Trademark Office response." It provides a one- 
day extension of patent term for every day that issuance of a 
patent is delayed by a failure of the PTO to comply with various 
enumerated statutory deadlines: fourteen months for a first 
office action; four months to respond to a reply; four months to 
issue a patent after the fee is paid; and the like. See 35 
U.S.C. § 154(b) (1) (A) (i)-(iv). Periods of delay that fit under, 
this provision are called "A delays" or "A periods." The second 
provision is the "[g] uarantee of no more than 3-year application 
pendency." Under this provision, a one-day term extension is 
granted for every day greater than three years after the filing 
date that it takes for the patent to issue, regardless of whether 
the delay is the fault of the PTO. 1 See 35 U.S.C. 
§ 154(b) (1) (B) . The period that begins after the three-year 
window has closed is referred to as the "B delay" or the XX B 
period". ("C delays, " delays resulting from interferences, 
secrecy orders, and appeals, are similarly treated but were not 
involved in the patent applications underlying this suit.) 



1 Certain reasons for exceeding the three-year pendency 
period are excluded, see 35 U.S.C. § 154(b) (1) (b) (i)-(iii), as 
are periods attributable to the applicant's own delay. See 35 
U.S.C. § 154 (b) (2) (C) . 

- 3 - 



The extensions granted for A, B, and C delays are 

subject to the following limitation: 

(A) In general. — To the extent that 
periods of delay attributable to 
grounds specified in paragraph (1) 
overlap, the period of any 
adjustment granted under this 
subsection shall not exceed the 
actual number of days the issuance 
of the patent was delayed. 

35 U.S.C. § 154(b) (2) (A). This provision is manifestly intended 
to prevent double-counting of periods of delay, but understanding 
that intent does not answer the question of what is double- 
counting and what is not. Proper interpretation of this 
proscription against windfall extensions requires an assessment 
of what it means for "periods of delay" to "overlap." 

The PTO, pursuant to its power under 35 U.S.C. 
§ 154(b) (3) (A) to "prescribe regulations establishing procedures 
for the application for and determination of patent term 
adjustments," has issued final rules and an "explanation" of the 
rules, setting forth its authoritative construction of the 
double-counting provision. The rules that the PTO has 
promulgated essentially parrot the statutory text, see 37 C.F.R. 
§ 1.703(f), and so the real interpretive act is found in 
something the PTO calls its Explanation of 37 CFR 1.703(f) and of 
the United States Patent and Trademark Office Interpretation of 
35 U.S.C. § 154(b) (2) (A), which was published on June 21, 2004, 
at 69 Fed. Reg. 34238. Here, the PTO "explained" that: 
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the Office has consistently taken 
the position that if an application 
is entitled to an adjustment under 
the three-year pendency provision 
of 35 O.S.C. § 154(b) (1) (B) , the 
entire period during which the 
application was pending before the 
Office (except for periods excluded 
under 35 U.S.C. § 154(b) (1) (B) 
(i)-(iii)), and not just the period 
beginning three years after the 
actual filing date of the 
application, is the relevant period 
under 35 U.S.C. § 154(b) (1) (B) in 
determining whether periods of 
delay " overlap" under 35 U.S.C. 
154(b) (2) (A) . 

69 Fed. Reg. 34238 (2004) (emphasis added). In short, the PTO' s 
view is that any administrative delay under § 154(b) (1) (A) 
overlaps any 3-year maximum pendency delay under § 154(b) (1) (B) : 
the applicant gets credit for "A delay" or for M B delay, " 
whichever is larger, but never A + B. 

In the plaintiffs' submission, this interpretation does 
not square with the language of the statute. They argue that the 
"A period" and "B period" overlap only if they occur on the same 
calendar day or days. Consider this example, proffered by 
plaintiff: A patent application is filed on 1/1/02. The patent 
issues on 1/1/08, six years later. In that six-year period are 
two "A periods," each one year long: (1) the 14-month deadline 
for first office action is 3/1/03, but the first office action 
does not occur until 3/1/04, one year late; (2) the 4-month 
deadline for patent issuance after payment of the issuance fee is 
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1/1/07, but the patent does not issue until 1/1/08, another year 
of delay attributable to the PTO. According to plaintiff, the "B 
period" begins running on 1/1/05, three years after the patent 
application was filed, and ends three years later, with the 
issuance of the patent on 1/1/08. In this example, then, the 
first "A period" does not overlap the "B period," because it 
occurs in 2003-04, not in 2005-07. The second "A period," which 
covers 365 of the same days covered by the "B period," does 
overlap. Thus, in plaintiff's submission, this patent holder is 
entitled to four years of adjustment (one year of "A period" 
delay + three years of "B period" delay). But in the PTO's view, 
since "the entire period during which the application was pending 
before the office" is considered to be "B period" for purposes of 
identifying "overlap," the patent holder gets only three years of 
adjustment . 

Chevron Deference 

We must first decide whether the PTO' s interpretation 
is entitled to deference under Chevron v. NRDC , 4 67 U.S. 837 
(1984) . No, the plaintiffs argue, because, under the Supreme 
Court's holdings in Gonzales v. Oregon , 546 U.S. 243 (2006), and 
United States v. Mead Corp. , 533 U.S. 218 (2001), Congress has 
not "delegated authority to the agency generally to make rules 
carrying the force of law, " and in any case the interpretation at 
issue here was not promulgated pursuant to any such authority. 
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See Gonzales , 546 U.S. at 255-56, citing Mead , 533 U.S. at 226- 
27. Since at least 1996, the Federal Circuit has held that the 
PTO is not afforded Chevron deference because it does not have 
the authority to issue substantive rules, only procedural 
regulations regarding the conduct of proceedings before the 
agency. See Merck & Co. v. Kessler , 80 F.3d 1543, 1549-50 (Fed. 
Cir. 1996) . 

Here, as in Merck , the authority of the PTO is limited 
to prescribing "regulations establishing procedures for the 
application for and determination of patent term adjustments 
under this subsection." 35 U.S.C. § 154(b)(3)(A) (emphasis 
added) . Indeed, a comparison of this rulemaking authority with 
the authority conferred for a different purpose in the 
immediately preceding section of the statute makes it clear that 
the PTO' s authority to interpret the overlap provision is quite 
limited. In 35 U.S.C. § 154(b) (2) (C) (iii) the PTO is given the 
power to "prescribe regulations establishing the circumstances 
that constitute a failure of an applicant to engage in reasonable 
efforts to conclude processing or examination of an application" 
(emphasis added) — that is, the power to elaborate on the 
meaning of a particular statutory term. No such power is granted 
under § 154(b) (3) (A) . Chevron deference does not apply to the 
interpretation at issue here. 
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Statutory Construction 

Chevron would not save the PTO' s interpretation, 
however, because it cannot be reconciled with the plain text of 
the statute. If the statutory text is not ambiguous enough to 
permit the construction that the agency urges, that construction 
fails at Chevron's "step one/' without regard to whether it is a 
reasonable attempt to reach a result that Congress might have 
intended. See, e.g. , MCI v. AT&T , 512 U.S. 218, 229 (1994) 
("[A]n agency's interpretation of a statute is not entitled to 
deference when it goes beyond the meaning that the statute can 
bear.") . 

The operative question under 35 U.S.C. § 154(b) (2) (A) 
is whether "periods of delay attributable to grounds specified in 
paragraph (1) overlap." The only way that periods of time can 
"overlap" is if they occur on the same day. If an "A delay" 
occurs on one calendar day and a "B delay" 'occurs on another, 
they do not overlap, and § 154(b) (2) (A) does not limit the 
extension to one day. Recognizing this, the PTO defends its 
interpretation as essentially running the "period of delay" under 
subsection (B) from the filing date of the patent application, 
such that a period of "B delay" always overlaps with any periods 
of "A delay" for the purposes of applying § 154 (b) (2) (A) . 

The problem with the PTO' s construction is that it 
considers the application delayed under § 154(b) (1) (B) during the 
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period before it has been delayed . That construction cannot be 
squared with the language of § 154(b) (1) (B) , which applies u if 
the issue of an original patent is delayed due to the failure of 
the United States Patent and Trademark Office to issue a patent 
within 3 years." (Emphasis added.) "B delay" begins when the 
PTO has failed to issue a patent within three years, not before. 

The PTO's interpretation appears to be driven by 
Congress's admonition that any term extension "not exceed the 
actual number of days the issuance of the patent was delayed, " 
and by the PTO's view that "A delays" during the first three 
years of an applications' pendency inevitably lead to "B delays" 
in later years. Thus, as the PTO sees it, if plaintiffs' 
construction is adopted, one cause of delay will be counted 
twice: once because the PTO has failed to meet and administrative 
deadline, and again because that failure has pushed back the 
entire processing of the application into the "B period." 
Indeed, in the example set forth above, plaintiffs' calendar-day 
construction does result in a total effective patent term of 18 
years under the (B) guarantee, so that - again from the PTO's 
viewpoint — the applicant is not "compensated" for the PTO's 
administrative delay, he is benefitted by it. 

But if subsection (B) had been intended to guarantee a 
17-year patent term and no more , it could easily have been 
written that way. It is true that the legislative context — as 
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distinct from the legislative history suggests that Congress 
may have intended to use subsection (B) to guarantee the 17-year 
term provided before GATT. But it chose to write a " [g] uarantee 
of no more than 3-year application pendency , " 35 U.S.C. 
§ 154(b) (1) (B) , not merely a guarantee of 17 effective years of 
patent term, and do so using language separating that guarantee 
from a different promise of prompt administration in subsection 
(A). The PTO' s efforts to prevent windfall extensions may be 
reasonable — they may even be consistent with Congress's 
intent — but its interpretation must square with Congress's 
words. If the outcome commanded by that text is an unintended 
result, the problem is for Congress to remedy, not the agency. 

JAMES ROBERTSON 
United States District Judge 
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UNITED STATES DISTRICT COURT 
FOR THE DISTRICT OF COLUMBIA 



Civil Action No. 07-1492 (JR) 



WYETH, et al., 

Plaintiffs, 

v. 

JON W. DUDAS, Under Secretary of 
Commerce for Intellectual 
Property and Director of U.S. 
Patent and Trademark Office, 

Defendant . 

ORDER 

For the reasons stated in the accompanying memorandum 
opinion, plaintiffs' motion for summary judgment [12] is GRANTED 
and defendant's motion for summary judgment [16] is DENIED. The 
case is remanded to the agency for further proceedings that are 
consistent with this opinion. 



JAMES ROBERTSON 
United States District Judge 



